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(57) Abstract: The present invention relates to a 
method for screening and identifjdng compounds 
that modulate premature translation termination 
and/or nonsense-mediated messenger ribonucleic 
acid ("mRNA") by interacting with a preselected 
target ribonucleic acid ("RNA"). In particular, the 
present invention relates to identifying compounds 
that bind to regions of the 28S ribosomal RNA 
("rRNA") and analogs thereof. Direct, noncompetitive 
binding assays are advantageously used to screen 
libraries of compounds for those that selectively bind 
to a preselected target RNA. Binding of target RNA 
molecules to a particular compound is detected using 
any physical method that measures the altered physical 
property of the target RNA bound to a compound. 
The structure of the compound attached to the labeled 
RNA is also determined. The methods used will 
depend, in part, on the nature of the library screened. The methods of the present invention provide a simple, sensitive assay for 
high-throughput screening of libraries of compounds to identify pharmaceutical leads. 
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METHODS FOR mENTIFYING SMALL MOLECULES THAT 
MODULATE PREMATURE TRANSLATION TERMINATION 
AND NONSENSE MEDIATED mRNA DECAY 

This application is entitled to and claims priority benefit to U.S. Provisional 
Patent Application No. 60/398,344, filed July 24, 2002 and U.S. Provisional Patent 
AppHcation No. 60/398,332, filed My 24, 2002, both of which are incorporated herein by 
reference in their entirety. 

1. INTRODUCTION 

The present invention relates to a method for screening and identifying 
conq)ounds tbat modulate premature translation termination and/or nonsense-mediated 
messenger ribonucleic acid CmRNA") decay by interacting vjdth a preselected target 
ribonucleic acid ("RNA"). In particular, the present invention relates to methods of 
identifying compounds that bind to regions of the 28S ribosomal RNA ("rRNA") and 
analogs thereof Direct, non-competitive binding assays are advantageously used to screen 
libraries of compounds for those that selectively bind to a preselected target RNA. Binding 
of target RNA molecules to a particular compound is detected using any physical meHiod 
that measures tiie altered physical property of the target RNA bound to a compound. The 
methods of the present invention provide a simple, saisitive assay for high-througIq)ut 
screeamg of libraries of compounds to identify pharmaceutical leads. 

2. BACKGROUND OF THF, INVENTION 
Protein synfliesis encompasses the processes of translation initiation, 
elongation, and termination, each of which has evolved to occur vsdth great accuracy and 
has the capacity to be a regulated step in the pathway of gene expression. Recent studies, 
including tiiose suggesting tiiat events at termination may regulate the ability of ribosomes 
to recycle to tiie start site of flie same mRNA, have underscored the potential of teraunation 
to regulate other aspects of translation. The RNA biplets UAA, UAG, and UGA are non- 
coding and promote tiranslational termination. Termination starts when one of the three 
termination codons enters the A site of the ribosome, tiiereby signaling the polypeptide 
chain release factors to bind and recognize the termination signal. Subsequently, the ester 
bond between the 3' nucleotide of the tiansfer RNA ("tRNA") located in the ribosome's P 
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site and the nascent polypeptide chain is hydrolyzed, the completed polypeptide chain is 
released, and the ribosome subunits are recycled for another round of translation. 

Nonsense-mediated mRNA decay is a surveillance mechanism that 
minimizes the translation and regulates the stability of RNAs that contain chain termmation 
nonsense mutations (see, e,g,, Hentze & Kulozik, 1999, CeU 96:307-310; Culbertson, 1999, 
Trends in Genetics 15:74-80; Li & Wilkinson, 1998, Immunity 8:135-141; and Ruiz- 
Echevarria et al, 1996, Trends in Biological Sciences, 21 :433-438). Chain teimination 
nonsense mutations are caused when a base substitution or frameshifl mutation changes a 
codon into a termination codon, i.e., a premature stop codon that causes translational 
teraiination. In nonsense-mediated mRNA decay, mRNAs with premature stop codons are 
frequently subjected to degradation. A truncated protein is produced as a result of the 
translation apparatus prematurely terminating at the stop codon. 

Nonsense mutations cause approximately 10 to 30 percent of the individual 
cases of virtually all inherited diseases. Although nonsense mutations inhibit the synthesis 
of a full-length protein to one percent or less of wild-type levels, minimally boosting the 
e>q)ression levels of the full-length protein to between five and fifteen percent of normal 
levels can eUminate or greatly reduce the severity of disease. Nonsense suppression causes 
the read-through of a termination codon and the generation of full-length protein. Certain 
aminoglycosides have been found to promote nonsense suppression (see, e,g., Bedwell et 
al, 1997, Nat. Med. 3:1280-1284 and Howard et a/., 1996, Nat Med. 2:467-469). Clinical 
approaches that target the translation termination event to promote nonsense suppression 
have recently been described for model systems of cystic fibrosis and muscular dystrophy; 
gentamicin is an ammoglycoside antibiotic that causes translational misreadmg and allows 
the insertion of an amino acid at the site of the nonsense codon in models of cystic fibrosis, 
Hurlers Syndrome, and muscular dystrophy (see, e.g., Barton-Davis et a/., 1999, J. Clin. 
Invest 104:375-381). These results strongly suggest that drugs that promote nonsense 
suppression by altering translation tennination efficiency of a premature termination codon 
can be ther£q)eutically valuable in the treatment of diseases caused by nonsense mutations. 

Certain classes of known antibiotics have been characterized and found to 
interact with RNA. For example, the antibiotic thiostrepton binds tightly to a 60-mer fi-om 
ribosomal RNA (CundUffe et al, 1990, in The Ribosome; Structure, Function & Evolution 
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5 (Schlessinger et. al, eds.) American Society for Microbiology, Washington, D.C. pp. 479- 
490), and bacterial resistance to various antibiotics often involves methylation at specific 
rRNA sites (Cundliffe, 1989, Ann. Rev. Microbiol. 43:207-233). In addition, certain 
aminoglycosides and other protein synthesis inhibitors have been found to interact with 
specific bases in 16S rRNA (Woodcock et al, 1991, EMBO J. 10:3099-3103); moreover, 

10 an oligonucleotide analog of the 16S rRNA has been shown to interact with certain 
aminoglycosides (Purohit et al, 1994, Nature 370:659-662). Aminoglycosidic 
aminocycHtol (amiuoglycoside) antibiotics and peptide antibiotics are known to inhibit 
group I intron splicing by binding to specific regions of the RNA (von Ahsen et aLj 1991, 
Nature (London) 353:368-370). Some of these same aminoglycosides have also been found 

15 to inhibit hammerhead ribozyme function (Stage et aL, 1995, RNA 1:95-101). A molecular 
basis for hypersensitivity to aminoglycosides has been found to be located in a single base 
change in mitochondrial rRNA (Hutchin et al, 1993, Nucleic Acids Res. 21 :4174-4179). 
Aminoglycosides have also been shown to inhibit the interaction between specific structural 
RNA motifs and the corresponding RNA binding protein. Zapp et al (Cell, 1993, 74:969- 

20 978) has demonstrated that the aminoglycosides neomycin B, lividomycin A, and 

tobramycin can block the binding of Rev, a viral regulatory protein required for viral gene 
expression, to its viral recognition element in the TIB (or RRE) region of HIV RNA. This 
blockage appears to be the result of competitive binding of the antibiotics directly to the 
RRE RNA structural motif 

25 Citation or identification of any reference in Section 2 of this application is 

not an admission that such reference is available as prior art to the present invention. 

3. SUMMARY OF THE INVENTION 

The present invention provides methods for identifying compounds that 
modulate translation termination and/or nonsense-mediated mRNA decay by identifying 

30 compounds that bind to preselected target elements of nucleic acids including, but not 
limited to, specific RNA sequences, RNA structural motifi, and/or RNA structural 
elements. In particular, the present invention provides methods of identifying compounds 
that bind to regions of the 28S rRNA and analogs thereof. The specific target RNA 
sequences, RNA structural moti&, and/or RNA structural elements (/.e., regions or 

35 firagments of the 28S rRNA and analogs thereof) are used as targets for screening small 
molecules and identifying those that directly bind these specific sequences, motifs, and/or 
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structural elements. For example, methods are described in which a preselected target RNA 
having a detectable label or method of detection is used to screen a library of compounds, 
preferably under physiologic conditions; and any complexes formed between the target ' 
RNA and a member of the Kbraiy are identified using physical methods that detect the 

labeled oraltered physical property ofthetargetRNA bound to acompound. Further, 
methods are described in which a preselected target RNA is used to screen a library of 
compounds, with each conq)ound in the Ubrary havmg a detectable label or method of 
detection, preferably under physiologic conditions; and any complexes formed between the 
target RNA and a member of the hTjrary are identified usmg physical methods that detect 
the labeled or altered physical property of the compound bound to target RNA 

The present invention provides methods for identifying compounds that bind 
to a target RNA regions or fragments of 28S rRNA, or RNA contaming a premature 
stop codon), said methods comprising contacting a target RNA having a detectable label 
with a Ubrary of compounds free in solution, in, e.g., labeled tubes or microtiter plate, and 
detecting the formation of a target RNA:compound conq)lex. M particular, the present 
invention provides methods for identifying compounds that bind to a target RNA {e.g., 
regions or fragments of the 28S rRNA or RNA containing a premature stop codon), slid 
methods conq)rising contacting a target RNA having a detectable label with a library of 
compounds free m solution, in, e.g., labeled tubes or a microtiter plate, and detecting the 
formation of a target RNA;compound complex. Conq)ounds m the Ubraiy that bind to the 
labeled target RNA will form a detectably labeled complex. The detectably labeled complex 
can then be identified and removed from the uncomplexed, unlabeled complex, and from 
uncomplexed, labeled target RNA, by a variety of methods, including, but not lunited to. 
methods fliat differentiate changes in the electrophoretic, chromatographic, or themiostable 
properties of the conq)lexed target RNA. Such methods include, but are not Umited to, 
electrophoresis, fluorescence spectroscopy, surface plasmon resonance, mass spectrometry, 
scintillation proximity assay, structure-activity relationships ("SARS") by NMR 
spectroscopy, size exclusion chromatography, afiSnity chromatography, and nanoparticle 
aggregation 

The present invention provides methods for identifying compounds that bind 
to a target RNA {e.g., regions of 28S rRNA or RNA containing a premature stop codon), 
said methods comprising contacting a target RNA having a detectable label with a Ubra^ of 
compounds bound, wherein each compound in the library is attached to a soUd support, and 
detecting tire formation ofa target RNA:compound complex, hi particular, the present' 

4 
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5 invention provides methods for identifying compounds that bind to a target RNA (e.g., 
regions or fragments of 28S rRNA, or RNA containing a premature stop codon), said 
methods comprising contacting a target RNA having a detectable label with a library of 
compounds wherein each compound is attached to a solid support, (e.g., ahead-based 
library of compounds or a microarray of compounds), and detecting the formation of a 

1 0 target KNA:compound complex. Compounds in the library that bind to the labeled target 
KNA will form a detectably labeled complex. Compounds in the library that bind to the 
labeled target RNA will form a solid support detectably labeled complex (e,g,, a bead- 
based-detectably labeles complex), which can be separated from the unbound solid support, 
beads) and unbound target RNA in the liquid phase by a number of physical means, 

15 including, but not limited to, flow cytometry, affinity chK)matogr25)hy, manual batch mode 
separatioii, suspension of beads in electric fields, and nodcrowave of the bead-based 
detectably labeled complex. 

The present invention provides methods for identifying compounds that bind 
to a target RNA (e.g., regions or fragments of 28S rRNA or RNA containing a premature 

20 stop codon), said methods comprismg contacting a target KNA with a library of 

compounds, wherein each compound in the library is detectably labeled, and detecting the 
formation of a target RNAicompound complex. In particular, the present invention 
provides methods for identifying conqpounds that bind to a target KNA (e.g., regions or 
fragments of 28S rRNA, or RNA containing a premature stop codon), said methods 

25 comprising contacting a target RNA wifli a library of compounds free in solution, in 
labeled tubes or a microtiter plate, wherein each compound in the library is detectably 
labeled, and detecting the formation of a target RNAxompound complex. Compounds in 
the library that bind to the labeled target KNA will form a detectably labeled complex. The 
detectably labeled complex can then be identified and removed from the uncomplexed, 

30 unlabeled complex, and from unconq>lexed, target RNA, by a variety of methods, 
includmg, but not limited to, methods that differentiate changes in the electrophoretic, 
chromatographic, or thermostable properties of the complexed target RNA. Such methods 
include, but are not limited to, electrophoresis, fluorescence spectroscopy, surface plasmon 
resonance, mass spectrometry, scintillation proximity assay, structure-activity relationships 

35 ("SARS") by NMR spectroscopy, size exclusion chromatography, aJBEmity chromatography, 
and nanoparticle aggregation. 

The present invention provides methods for identifying compounds that bind 
to a target RNA (eg,, regions or fragments of 28S rRNA or RNA containing a premature 
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Stop codon), said methods comprising contacting a target RNA attached or conjugated to a 
solid support with a library of compounds, wherein each compound in the library is 
detectably labeled, and detecting the formation of a target RNAxompound complex. 
Target RNA molecules that bind to labeled compoimds will form a detectable labeled 
complex. Target RNA molecules that bmd to labeled compounds will form soHd support- 
detectably labeled complex, which can be separated from unbound solid support-target 
RNA and unbound labeled compounds in the liquid phase by a number of means, including, 
but not limited to, flow cytometry, afiSnity chromatography, manual batch mode separation, 
suspension of beads in electric fields, and microwave of the bead-based detectably labeled 
complex. 

In a specific embodiment, the invention provides a method for identiJfying a 
con5>ound that binds to a target RNA, said method comprising: (a) contacting a detectably 
labeled target RNA molecule with a library of compounds under conditions that permit 
direct binding of the labeled target RNA to a member of the library of compounds and the 
formation of a detectably labeled target RNA:compound complex, wherein the target RNA 
is a region or fragment of 28S rRNA, or contains a premature stop codon; and (b) detecting 
the formation of a labeled target RNAxompound complex. In another embodiment, the 
invention provides a method for identifying a compound that binds to a target RNA, said 
method comprising detecting the formation of a detectably labeled target RNAxompound 
complex formed from contacting a detectably labeled RNA with a member of a hT^rary of 
compoimds under conditions that pemiit direct binding of the labeled target RNA to a 
member of the hbraiy of compounds and the formation of a labeled target RNAxompound 
complex, wherein the target RNA is a region or fragment of 28S rRNA, or contains a 
premature stop codoiL In accordance with these embodiments, each compound in the 
library may be attached to a solid support. Non-limitmg examples of solid supports include 
a silica gel, a resin, a derivatized plastic fihn, a glass bead, cotton, a plastic bead, a 
polystyrene bead, an aluminum gel, a glass slide or a polysaccharide. 

In another specific embodiment, the invention provides a method for 
identifying a compound that binds to a target RNA, said method conq)rising: (a) contacting 
a target RNA molecule with a library of detectably labeled compounds under conditions 
that permit direct binding of the target RNA to a member of the library of labeled 
compounds and the formation of a detectable target RNAxompound complex, wherein the 
target RNA is a region or fragment of 28S rRNA, or contains a premature stop codon; and 
(b) detecting the formation of a target RNAxompound complex, hi another embodiment, 

6 
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the inventioii provides a method for identifymg a compound that binds to a target RNA, 
said method comprising detecting the formation of a target RNArcompoimd conq)lex 
fomied from contacting a RNA with a member of a library of detectably labeled compounds 
under conditions that permit direct binding of the target RNA to a member of the library of 
labeled compounds and the formation of a target RNAiconqjound complex, wherein the 
target RNA is a region or fragment of 28S rRNA, or contains a premature stop codon. In 
accordance with these embodiments, the target RNA may be attached to a solid support. 
Non-limiting exanq)les of solid siq»ports include a alica gel, a resin, a derivatized plastic 
film, a glass bead, cotton, a plastic bead, a polystyrene bead, an aluminum gel, a glass slide 
or a polysaccharide. 

In another specific embodiment, the invention provides a method for 
identifying a confound that binds to a target RNA, said method comprising: (a) contacting 
a detectably labeled target RNA molecule with a library of detectably labeled compounds 
under conditions that pamit direct binding of the labeled target RNA to a member of the 
library of labeled and the formation of a detectable target RNArcompound complex, 
wherein the target RNA is aregion or fiagment of 28S rRNA, or contains a premature stop 
codon; and (b) detecting the formation of a target RNAxompound complex. Jn another 
embodiment, the invention provides a method for identifying a compound that binds to a 
target RNA, said method comprising detecting flie formation of a target RNArcompound 
con^lex formed Sx>m contacting a labeled RNA with a member of a library of detectably 
labeled compounds under conditions that pennit direct binding of flie labeled target RNA to 
a member of flie library of labeled compounds and the formation of a target 
RNAicompound complex, wherein flie target RNA is a region or fiagment of 28S rRNA or 
contains a premature stop codon. In accordance with these embodiments, tiie target RNA 
may be attached to a solid siq)port. Non-limitmg examples of solid supports are provided 
infra. A number of techniques can be used to detect the interaction between target RNA 
and the compounds of the invention. In a specific embodiment, fluorescence resonance 
energy transfer (FRET) is used to detect the interaction between flie target RNA and the 
confound of the invention. Examples of FRET assays are known in the art and are also 
provided herein (see, e.g.. Section 5.6.2). 

The methods described herein for the identification of compounds that 
directly bind to 28S rRNA ot a RNA containing a premature stop codon are well suited for 
high-throughput screening. The direct binding method of the invention offers advantages 
over drug screening systems for competitors that inhibit the formation of natuially- 

7 
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5 occuiring RNA binding protein:target RNA complexes; ie., con^etitive assays. The direct 
binding method of the invention is rapid and can be set op to be readily perfomed, e.g., by 
a technician, making it amenable to high-througj^ut screening. The mettiods of the 
invention also eliminate the bias inherent in the conq)etitive drug screening systems, which 
require the use of a preselected host ceU fector that may not have physiological relevance to 

10 the activity of the target RNA. Instead, the methods of the invention are used to identify 
any compound that can directly bind to a target RNA, {eg., 28S rKNA or a RNA containing 
a premature stop codon), preferably under physiologic conditions. As a result, the 
compounds so identified can inhibit the interaction of the target RNA with any one or more 
of the native host ceU factors (whether known or unknown) required for activity of the RNA 

15 in vivo. 

The compounds utilized in the assays described herein may be members of a 
library of conq)ounds. In specific embodiment, the compound is selected from a 
combinatorial Hbrary of compounds comprising peptides; random biooUgomers; 
diversomers such as hydantoins, benzodiazepines and dipeptides; vinylogous polypeptides; 
20 nonpeptidal peptidomimetics; oligocarbamates; peptidyl phosphonates; peptide nucleic acid 
Ubraries; antibody Ubraries; carbohydrate Kbraries; and small organic molecule libraries. In 
a preferred embodiment, the small organic molecule libraries are libraries of 
benzodiazepines, isoprenoids, thiazoUdinones, metatfaiazanones, pyiroUdines, moipholino 
compounds, or diazqjindiones. 

25 In certain embodiments, the compounds are screened in pools. Once a 

positive pool has been identified, the individual compounds of that pool are tested 
separately. Li certain embodunents, the pool size is at least 2, at least 5, at least 10, at least 
25. at least 50. at least 75. at least 100, at least 150, at least 200, at least 250, or at least 500 
compounds. 

Once a compound is identified in accordance with the mvention, the 
structure of the compound may be determined utilizing weU-known techniques or by 
referring to a predetermined code. The methods used will depend, in part, on the nature of 
the library screened. For example, assays of microarrays of compounds, each having an 
address or identifier, may be deconvoluted, e.g., by cross-referencing the positive sample to 
original compound list that was applied to the individual test assays. Another method for 
identifying compounds includes de novo siructure determination of the coiiq)ounds using, 
for example, mass spectrometry or nuclear magnetic resonance C*NMR"). The compounds 
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5 identified are useful for any purpose to which a binding reaction may be put, for example in 
assay methods, diagnostic procedures, cell sorting, as inhibitors of target molecule function, 
as probes, as sequestering agents and the like. In addition, small organic molecules which 
interact specifically with target RNA molecules may be useful as lead compounds for the 
development of therapeutic agents. 

10 A compound identified in accordance with the methods of the invention may 

bind to a premature stop codon. A compound identified in accordance with the methods of 
invention may also disrupt an iateraction between a premature stop codon and the mRNA 
translation machinery. In a preferred embodiment, a compound identified in accordance 
with the methods of the invention binds to RNA and suppresses premature translation 

15 temiination and/or nonsense-mediated mRNA decay of a gene encoding a protein, 

polypeptide or peptide whose expression is beneficial to a subject In another preferred 
embodiment, a compound identified in accordance with the methods of the invention binds 
to RNA and increases premature translation termmation and/or nonsense-inediated mRNA 
decay of a gene encodmg a protein, polypeptide or peptide whose expression is detrimental 

20 to a subject. In a specific embodiment, a compound identified in accordance with the 
methods of the invention preferentially or differentially modulates premature translation 
termination and/or nonsense-mediated mRNA decay of a specific nucleotide sequence of 
interest relative to another nucleotide sequence. 

In certain embodiments of the invention, the compound identified using the 

25 assays described herein is a small molecule. In a preferred embodiment, the compoxmd 
identified using the assays described herein is not known to affect premature translation 
termination and/or nonsense-mediated mRNA decay of a nucleic acid sequence, in 
particular a nucleic acid sequence of interest. In another preferred embodiment, the 
compound identified using the assays described herein has not been used as or suggested to 

30 be used in the prevention, treatment, management and/or amelioration of a disorder 

associated with, characterized by or caused by a premature stop codon. In another preferred 
embodiment, the compound identified usmg the assays described herein has not been used 
as or suggested to be used in the prevention, treatment, management and/or amelioration of 
a particular disorder described herein. 

35 A compound identified in accordance with the methods of the invention 

may be tested in in vitro and/or in vivo assays well-known to one of skill in the art or 
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described herein to detennine (he prophylactic or therapeutic effect of a particular 
compound for aparticular disorder. In particular, a compound identified utilizing the 
assays described herein may be tested in an ammal model to determine the efficacy of the 
compound in the prevention, treatment or amelioration of a disorder associated with, 
characterized by or caused by a premature stop codon, or a disorder described herein, or a 
symptom thereof. In addition, a compound identified utilizing the assays described herem 
may be tested for its toxicity in in vitro and/or in vivo assays weU-known to one of skiU in 
the art. Further, a compound identified as binding to a target RNA utilizing assays 
described herein or those weU-known in the art may be tested for its abihty to modulate 
premature translation and/or nonsense mediated mRNA decay. 

In a specific embodiment, the invention provides a method for identifying a 
compound to test for its ability to modulate premature translation termination or nonsense- 
mediated mSNA decay, said method comprising: (a) contacting a detectably labeled target 
RNA molecule with a hbrary of compounds under conditions that permit direct bmding of 
the labeled target RNA to a member of the Ubrary of compounds and the formation of a 
detectably labeled target RNArcompound complex, wherein the target RNA is a region or 
fragment of 28S rRNA or contains a premature stop codon; and (b) detectmg a detectably 
labeled target RNArcompound complex fomied in step(a), so that if a target 
RNArcompound complex is detected then the compound identified is tested for its abihty to 
modulate premature translation or nonsense-mediated mRNA decay. 

hi a specific embodunent, the invention provides a method for identifying a 
compound to test for its abiUty to modulate premature translation termination or nonsense- 
mediated mRNA decay, said method comprismgr (a) contacting a target RNA molecule 
with a hbrary of detectably labeled compounds under conditions that pennit direct bmdmg 
of the target RNA to a member of the Ubrary of labeled compounds and the formation of a 
detectably labeled target RNArcompound complex, wherein the target RNA is a region or 
fiagment of 28S rRNA or contams a premature stop codon; and (b) detecting a detectably 
labeled target RNArcompound complex formed m step(a), so that if a target 
RNArconqjound conqilex is detected then the compound identified is tested for its abihty to 
modulate premature translation or nonsense-mediated mRNA decay. 



10 
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5 The invention provides cell-based and cell-free assays to test the ability of a 

compound identified in accordance with the methods of the invention to modulate 
premature translation termination and/or nonsense-mediated mRNA decay. In particular, 
the invention provides cell-based and cell-free reporter assays for the identification of a 
compound that modulates premature translation termination and/or nonsense-mediated 

10 mRNA decay. In general, the level of expression and/or activity of a reporter gene product 
in the reporter gene based-assays described herein is indicative of the effect of the 
compound on premature translation termination and/or nonsense-mediated mRNA decay. 
The reporter gene-based assays described herein for the identification of compounds that 
modulate premature translation termination and/or nonsense-mediated mRNA decay are 

1 S well suited for high-throughput screening. 

The reporter gene cell-based assays may be conducted by contacting a 
compound with a cell containing a nucleic acid sequence comprising a reporter gene, 
wherein the rq)orter gene contains a premature stop codon or nonsense mutation, and 
measimng the expression of the reporter gene. The reporter gene cell-free assays may be 

20 conducted by contacting a compound with a cell-firee extract and a nucleic acid sequence 
comprising a reporter gene, wherein the reporter gene contains a premature stop codon or 
nonsense mutation, and measuring the expression of the reporter gene. In the cell-based 
and cell-free reporter gene assays described herein, the alteration in reporter gene 
expression or activity relative to a previously detennined reference range, or to the 

25 expression or activity of the reporter gene in the absence of the compound or the presence 
of an appropriate control {e.g,, a. negative control) indicates that a particular compound 
modulates premature translation termination and/or nonsense-mediated mRNA decay. In 
particular, an increase in reporter gme expression or activity relative to a previously 
determined reference range, or to the expression in the absence of the compound or the 

30 presence of an appropriate control (e.g., a negative control) may, depending upon the 
parameters of the reporter gene assay, indicate that a particular compound reduces or 
suppresses premature translation termination and/or nonsense-mediated mRNA decay 
increases nonsense suppression). In contrast, a decrease in reporter gene expression or 
activity relative to a previously detennined reference range, or to the expression in the 

35 absence of the compound or the presence of an appropriate control (e,g,, a negative control) 
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may, depending upon the parameters of the reporter gene-based assay, indicate that a 
particular compound enhances premature translation termination and/or nonsense-mediated 
mRNA decay (i.e., decreases nonsense suppression). 

In a specific embodiment, the invention provides a method of identifying a 
compound that modulates premature translation termination or nonsense-mediated mRNA 
decay, said method comprising: (a) contacting a detectably labeled target KNA molecule 
with a Ubrary of compounds under conditions that permit direct binding of the labeled target 
RNA to amember of the library of compounds and the formation of a detectably labeled 
target KNAxompound complex, wherein the target RNA is a region of 28S rKNA or 
contains apremature stop codon; (b) detecting a labeled target RNArcompound complex 
formed in step(a); so that if a target RNA:compound complex is detected, then (c) 
contacting the compound with a cell-free translation mixture and a nucleic acid sequence 
comprising a regulatory element operably linked to a reporter gene, wherein the reporter 
gene contains a premature stop codon; and (d) detecting the expression of die reporter gene, 
wherein a compound that modulates premature translation tennination or nonsense- 
mediated mKNA decay is identified if the expression of the reporter gene in the presence of 
the compound is altered relative to the expression of the reporter gene in the absence of the 
compound or the presence of a negative control. In accordance with Ms embodiment, each 
compound in the library may be attached to a solid siqjport 

in another embodiment, the invention provides a metlwd of identifying a 
compound that modulates pranature translation termination or nonsense-mediated mRNA 
decay, said method comprising: (a) contacting a detectably labeled target RNA molecule 
with a Iftrary of compounds under conditions that permit direct binding of the labeled target 
RNA to a member of the library of compounds and the formation of a detectably labeled 
target RNArcompound complex, wherein the target RNA is a region of 28S rRNA or 
contains apremature stop codon; (b) detecting a labeled target RNArcompound complex 
formed in step(a); so that if a target RNA compound complex is detected, then (c) 
contacting the compound with a ceU containing a nucleic acid sequence comprising a 
regulatory element operably linked to a reporter gene, wherein the reporter gene contains a 
premature stop codon; and (d) detecting the expression of the reporter gene, wherein a 
compound that modulates premature translation termination or nonsense-mediated mRNA 
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decay is identified if the expression of the reporter gene in the presence of the compound is 
altered relative to the expression of the reporter gene in the absence of the compound or the 
presence of a negative control. In accordance with this embodiment, each compound in the 
library may be attached to a solid svqjport. 

In a specific embodiment, the invention provides a method of identifying a 
compound that modulates premature translation teimination or nonsense-mediated mKNA 
decay, said method comprising: (a) contacting a detectably labeled target RNA molecule 
with a library of compounds under conditions that permit direct binding of the labeled target 
RNA to a member of the library of compounds and the formation of a detectably labeled 
target RNAxompound complex, wherein the target KNA is a region of 28S rKNA or 
contams a premature stop codon; (b) detecting a labeled target RNAxompound complex 
formed in stq)(a); then (c) contacting the compound with a cell-fi-ee translation mixture and 
a nucleic acid sequence comprising a regulatory element operably linked to a reporter gene, 
wherein flie reporter gene contains a premature stop codon; and (d) detecting the expression 
of the reporter gene, wherem a compound that modulates premature hmslation teimination 
or nonsense-mediated mKNA decay is identified if the expression of flie reporter gene in tbe 
presence of the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. In accordance with this 
embodiment, each compound in the library may be attached to a soUd support 

In another embodiment, the invention provides a method of identifying a 
compound that modulates premature translation termination or nonsense-mediated mRNA 
decay, said method comprising: (a) contacting a detectably labeled target RNA molecule 
with a Ubrary of compounds under conditions tiiat permit direct binding of the labeled target 
RNA to a member of the library of compounds and the formation of a detectably labeled 
target RNA:compound con^lex, wherein the target RNA is a region of 28S rRNA or 
contains a premature stop codon; (b) detecting a labeled target RNA:compound complex 
formed in step(a); then (c) contacting the compound witii a cell containing a nucleic acid 
sequence conq)rising a regulatory element operably linked to a reporter gene, wherem the 
reporter gene contains a premature stop codon; and (d) detecting the expression of the 
reporter gene, wherein a compound that modulates prematine translation termination or 
nonsense-mediated mRNA decay is identified if the expression of the reporter gene in the 
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.5 presence of the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. In accordance with this 
embodiment, each compound in the Kbrary may be attached to a soUd support. 

In a specific embodiment, the invention provides a method of identifying a 
conq)ound that modulates premature translation termination or nonsense-mediated mRNA 
10 decay, said method comprising: (a) contacting a target RNA molecule with a library of 

detectably labeled compounds under conditions that pemiit direct binding of the target RNA 
to a member of the library of labeled compounds and the formation of a detectably labeled 
target KNTAxompound complex, wherein the target RNA is a region of 28S rRNA or 
contains a premature stop codon; (b) detecting a labeled target RNAcompound complex 
formed in step(a); then (c) contacting the compound with a cell-free translation mixture and 
a nucleic add sequence comprising a regulatory element operably linked to a reporter gene, 
wherein the reporter gene contains a premature stop codon; and (d) detecting the expression 
of the reporter gene, wherein a compound that modulates premature translation termination 
or nonsense-mediated mRNA decay is identified if the expression of the reporter gene in the 
20 presence of the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. In accordance with this 
embodiment, the target RNA may be attached or conjugated to a soUd support, or detectably 
labeled. 

In another embodiment, the invention provides a mefliod of identifying a 
25 compound that modulates premature translation temiination or nonsense-mediated mRNA 
decay, said method comprising: (a) contacting a target RNA molecule with a library of 
detectably labeled compounds under conditions that permit direct binding of the target RNA 
to a member of the Ubrary of labeled compounds and the formation of a detectably labeled 
target RNA:compound complex, wherein the target RNA is a region of 28S rRNA or 
contains a premature stop codon; (b) detecting a labeled target RNArcompound complex 
farmed in stq)(a); then (c) contacting the compound with a ceU containing a nucleic acid 
sequence comprising a regulatory element operably linked to a reporter gene, wherein the 
reporter gene contains apremature stop codon; and (d) detecting the expression of the 
reporter gene, wherein a compound that modulates premature tiranslation termination or 
35 nonsense-mediated mRNA decay is identified if the expression of the reporter gene in the 
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presence of the compound is altered relative to the expression of the reporter gene in the 
absence of the compound or the presence of a negative control. In accordance with this 
embodiment, the target RNA may be attached or conjugated to a solid support, or detectably 
labeled. 

In a specific embodunent, the invention provides a method of identifying a 
compound that modulates premature translation termination or nonsense-mediated mRNA 
decay, said method comprising: (a) contacting a target RNA molecule with a library of 
detectably labeled compounds under conditions that permit direct binding of the target RNA 
to a member of the hbrary of labeled compounds and the formation of a detectably labeled 
target RNArcompound complex, wherein the target RNA is a region of 28S rRNA or 
contains a premature stop codon; (b) detecting a labeled target RNArcompound complex 
formed in step(a); so that if a target RNArcompound complex is detected, then (c) 
contacting the compound with a cell-jfree translation mixture and a nucleic acid sequence 
comprising a regulatory element operably linked to a reporter gene, wherein the reporter 
gene contains a premature stop codon; and (d) detecting the expression of the reporter gene, 
wherein a compound that modulates premature translation termination or nonsense- 
mediated mRNA decay is identified if the expression of the reporter gene in the presence of 
the compound is altered relative to the expression of the reporter gene in the absence of the 
compound or the presence of a negative control. In accordance with this embodiment, the 
target RNA may be attached or conjugated to a solid support, or detectably labeled. 

In another embodiment, the invention provides a method of identifying a 
compound that modulates premature translation termination or nonsense-mediated mRNA 
decay, said method comprising: (a) contacting a target RNA molecule with a library of 
detectably labeled compounds under conditions that permit direct binding of the target RNA 
to a member of the library of labeled compounds and the formation of a detectably labeled 
target RNArcompound complex, wherein the target RNA is a region of 28S rRNA or 
contains a premature stop codon; (b) detectmg a labeled target RNArcompound complex 
formed in step(a); so that if a target RNArcompound complex is detected, then (c) 
contacting the compound with a cell containing a nucleic acid sequence comprising a 
regulatory element operably linked to a reporter gene, wherein the reporter gme contains a 
premature stop codon; and (d) detecting the expression of the reporter gene, wherein a 
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compound that modulates premature translation termination or nonsense-mediated mRNA 
decay is identified if Ihe expression of the reporter gene in the presence of the compound is 
altered relative to the ejcpression of the rq)orter gene in the absence of the compound or the 
presence of a negative control. In accordance with this embodiment, the target RNA may 
be attached or conjugated to a solid support, or detectably labeled. 

The invention provides methods for preventing, treating, managing or 
ameliorating a disorder associated with, characterized by or caused by a premature 
translation termination and/or nonsense-mediated mKNA decay or a symptom thereof, said 
method comprising administering to a subject in need thereof a therapeutically or 
prophylactically effective amount of a compound, or a phannaceutically acceptable salt 
thereof identified according to the mefliods described herein. 

The present invention may be understood more fiilly by reference to the 
detailed description and examples, which are intended to illustrate non-limiting 
embodiments of the invention. 

3.1 Terminology 

As used herein, the temi "compound" refers to any agent or complex that is 
bemg tested for its ability to interact with a target nucleic acid (in particular, a target KNA) 
or has been identified as interacting with a target nucleic acid (in particular, a target RNA). 

As used herein, the terms "disorder" and "disease" are to refer to a condition 

in a subject. 

As used herein, a "dye" refers to a molecule that, when exposed to radiation, 
emits radiation at a level that is detectable visually or via conventional spectroscopic means. 
As used herein, a ♦'visible dye" refers to a molecule having a chromophore that absorbs 
radiation in the visible region of the spectrum (i.e., having a wavelength of between about 
400 nm and about 700 nm) such that the transmitted radiation is in the visible region and 
can be detected either visually or by conventional spectroscopic means. As used herein, an 
"ultraviolet dye" refers to a molecule having a chromophore that absorbs radiation in the 
ultraviolet region of the spectrum {i.e., having a wavelength of between about 30 nm and 
about 400 nm). As used herein, an "infi-ared dye" refers to a molecule having a 
chromophore that absorbs radiation in the infiared region of the spectrum {i.e., having a 
wavelength between about 700 nm and about 3,000 nm). A "chromophore" is the network 
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of atoms of the dye that, when exposed to radiation, emits radiation at a level that is 
detectable visually or via conventional spectroscopic means. One of skill in the art will 
readUy ^preciate that although a dye absorbs radiation in one region of the spectrum, it 
may emit radiation m another region of the spectrum. For example, an ultraviolet dye may 
emit radiation in the visible region of the spectrum. One of skiU m the art will also readily 
appreciate that a dye can transmit radiation or can emit radiation via fluorescence or 
phosphorescence. 

As used herein, the term "effective amount" refers to the amoimt of a 
compound which is sufficient to (i) reduce or ameliorate the progression, severity and/or 
duration of a disorder {e.g., a disorder associated with, characterized by or caused by 
premature translation termination and/or nonsense-mediated mRNA decay), or one or more 
symptoms hereof, (ii) prevent the development, recurrence or onset of a disorder {e.g., a 
disorder associated with, characterized by or caused by premature translation teraiination 
and/or nonsoise-mediated mRNA decay), or one or more symptoms thereof (iii) prevent 
the advancement of a disorder a disorder associated with, characterized by or caused 
by premature translation termination and/or nonsense-mediated mRNA decay), or one or 
more symptoms thereof, or (iv) enhance or unprove the therapeutic e£fect(s) of another 
therapy. 

As used herein, the term "fragment", m the context of a protein or 
polypeptide refers to a peptide sequence of at least 5 contiguous residues, at least 10 
contiguous residues, at least 15 contiguous residues, at least 20 contiguous residues, at least 
25 contiguous residues, at least 40 contiguous residues, at least 50 contiguous residues, at 
least 60 contiguous residues, at least 70 contiguous residues, at least 80 contiguous residues, 
at least 90 contiguous residues, at least 100 contiguous residues, at least 125 contiguous 
residues, at least 150 contiguous residues, at least 175 contiguous residues, at least 200 
contiguous residues, or at least 250 contiguous residues of tiie sequence of anolher protein 
or polypeptide. In a specific embodiment, a fi:agment of a protein or polypeptide retains at 
least one fimction of the protein or polypeptide. 

As used herein, the term "fragment", in the context of a nucleic acid 
sequence refers to a nucleotide sequence of at least 5 contiguous bases, at least 10 
contiguous bases, at least 15 contiguous bases, at least 20 contiguous bases, at least 25 
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5 contiguoxis bases, at least 40 contiguous bases, at least 50 contiguous bases, at least 60 
contiguous bases, at least 70 contiguous liases, at least 80 contiguous bases, at least 90 
contiguous bases, at least 100 contiguous bases, at least 125 contiguous bases, at least 150 
contiguous bases, at least 175 contiguous bases, at least 200 contiguous bases, or at least 
250 contiguous bases of the sequence of another nucleic acid sequence. In a specific 

10 embodiment, a fragment of a nucleic acid sequence retains at least one domain of the 
nucleic acid sequence. 

As used herein, the tenn "in combination" refers to the use of more than one 
therapy {e.g., prophylactic and/or therapeutic agents). The use of the term "m combination" 
does not restrict the order in which therapies prophylactic and/or therapeutic agents) 

15 are administered to a subject with a disorder (eg., a disorder associated with, characterized 
by or caused by premature translation termination and/or nonsense-mediated mRNA 
decay). A first therapy a prophylactic or therapeutic agent such as a compound 
identified in accordance with the methods of the invention) can be administered prior to 
(eg., 5 minutes, 15 minutes, 30 minutes, 45 minutes, 1 hour, 2 hours, 4 hours, 6 hours, 12 

20 hours, 24 hours, 48 hours, 72 hours, 96 hours, 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 
6 weeks, 8 weeks, or 12 weeks before), concomitantly with, or subsequent to (e.g., 5 
minutes, 15 minutes, 30 minutes, 45 minutes, 1 hour, 2 hours, 4 hours, 6 hours, 12 hours, 24 
hours, 48 hours, 72 hours, 96 hours, 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 
8 weeks, or 12 weeks after) the administration of a second therapy (e^., a prophylactic or 

25 therapeutic agent such as a chemotherapeutic agent or a TNF-a antagonist) to a subject with 
a disorder (eg., a disorder associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay). 

As used herein, a "label" or "detectable label" is a composition that is 
detectable, either directly or indirectly, by spectroscopic, photochemical, biochemical, 

30 immunochemical, or chemical means. For example, useful labels include radioactive 
isotopes (e.g., ^^P, ^^S, and ^H), dyes, fluorescent dyes, electron-dense reagents, enzymes 
and their substrates (e.g., as commonly used in enzyme-linked immimoassays, eg., alkaline 
phosphatase and horse radish peroxidase), biotin, streptavidin, digoxigenin, or haptens and 
proteins for which antisera or monoclonal antibodies are available. Moreover, a label or 

35 detectable moiety can include an "affinity tag" that, when coupled with the target nucleic 
acid and incubated with a compound or compound library, allows for the affinity capture of 
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the target nucleic acid along with molecules bound to the target nucleic acid. One skilled in 
the art will appreciate that an affinity tag bound to the target nucleic acids has, by 
definition, a complimentary ligand coupled to a solid support that allows for its capture. 
For example, useful afiBnity tags and complimentary ligands or partners include, but are not 
limited to, biotm-streptavidin, complimentary nucleic acid fragments (e.g., oligo dT-oligo 
dA, oKgo T-oUgo A, oKgo dG-oUgo dC, oKgo G-oKgo Q, aptamer complexes, aptamere, or 
haptens and proteins for which antisera or monoclonal antibodies are available. The label 
or detectable moiety is typically bound, either covalently, through a linker or chemical 
bound, or through ionic, van der Waals or hydrogen bonds to the molecule to be detected. 

As used herein, a "library"' in the context of compounds refers to a plurality 
of compounds with which a target nucleic acid molecule is contacted. A library can be a 
combinatorial library, e.g,, a collection of compounds synthesized using combinatorial 
chemistry techniques, or a collection of unique chemicals of low molecular weight (less 
than 1000 daltons) that each occupy a unique three-dimensional space. 

As used herein, the terms "manage", "managing" and "management" refer to 
the beneficial effects that a subject derives fi-om a therapy (e.g., a prophylactic or 
therapeutic agent) which does not result in a cure of the disorder, (eg., a disorder associated 
with, characterized by or caused by premature translation teraiination and/or nonsense- 
mediated mRNA decay). In certain embodiments, a subject is administered one or more 
therapies to '^manage" a disease or disorder so as to prevent flie progression or worsening of 
the disease or disorder. 

As used herein, the phrase ^'modulation of premature translation termination 
and/or nonsense-mediated mRNA decay*' refers to the regulation of gene expression by 
altering the level of nonsense suppression. For example, if it is desirable to increase 
production of a defective protein encoded by a gene with a premature stop codon, i.e., to 
permit read through of the premature stop codon of the disease gene so translation of the 
gene can occur, then modulation of premature translation termmation and/or nonsense- 
mediated mRNA decay entails up-regulation of nonsense suppression. Conversely, if it is 
desirable to promote the degradation of an mRNA with a premature stop codon, then 
modulation of premature translation termination and/or nonsense-mediated mRNA decays 
entails down-regulation of nonsense suppression. 
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5 As used herein, the tenns **non-responsive" and **refractory describe 

patients treated with a currently available therapy (e,g,, prophylactic or therapeutic agent) 
for a disorder (e.g., a disorder associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mKNA decay such as, cancer), 
which is not clinically adequate to reUeve one or more symptoms associated with such 

10 disorder. Typically, such patients suffer from severe, persistently active disease and require 
additional therapy to ameliorate the symptoms associated with their disorder. 

As used herein, "nonsense-mediated niRNA decay** refers to any mechanism 
that mediates the decay of niRNAs containing a premature translation termination codon. 

As used herein, a '"nonsense mutation" is a point mutation changing a codon 

1 5 corresponding to an amino acid to a stop codon. 

As used herein, **nonsense suppression" refers to the inhibition or 
suppression of premature translation tamination and/or nonsense-mediated mKNA decay- 
As used herein, the phrase '^harmaceutically acceptable salt(s)" includes but 
is not limited to salts of acidic or basic gFoiq)s that may be present in compounds identified 

20 using the methods of the present invention. Compounds that are basic in nature are capable 
of fonning a wide variety of salts with various inorganic and organic acids. The acids that 
can be used to prepare pharmaceutically acceptable acid addition salts of such basic 
compounds are those that form non-toxic acid addition salts, i.e., salts containing 
pharmacologically acceptable anions, including but not limited to sulfuric, citric, maleic, 

25 acetic, oxalic, hydrochloride, hydrobromide, hydroiodide, nitrate, sulfate, bisulfate, 

phosphate, acid phosphate, isonicotinate, acetate, lactate, sahcylate, citrate, acid citrate, 
tartrate, oleate, tannate, pantoth^ate, bitartrate, ascorbate, succinate, maleate, gentisinate, 
fumarate, gluconate, glucaronate, saccharate, formate, benzoate, glutamate, 
methanesulfonate, ethanesulfonate, benzenesulfonate, p-toluenesulfonate and pamoate (z.e., 

30 l,r-methylene-bis-(2-hydroxy-3-naphthoate)) salts. Compounds that include an amino 
moiety may form pharmaceutically or cosmetically accq)table salts with various amino 
acids, in addition to the acids mentioned above. Compounds that are acidic in nature are 
enable of forming base salts with various pharmacologically or cosmetically acceptable 
cations. Examples of such salts include alkali metal or alkaline earth metal salts and, 

35 particularly, calcium, magnesixmi, sodium lithium, zinc, potassium, and iron salts. 
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As Tised herein, the tenn "previously deteraiined reference range" refers to a 
reference range for the readout of a particular assay. Each laboratory wiU estabUsh its own 
reference range for each particular assay In a preferred embodiment, at least one positive 
control and at least one negative control are included in each batch of compounds analyzed. 

As used herein, a "premature termination codon" or "premature stop codon" 
refers to the occurrence of a stop codon instead of a codon corresponding to an amino acid. 

As used herein, '^premature translation termination" refers to the result of a 
mutation that changes a codon corresponding to an amino acid to a stop codon. 

As used herein, the terms "prevent", " preventing" and '^prevention" refer to 
the prevention of the development, recurraice or onset of a disorder (e.g., a disorder 
associated with, characterized by or caused by premature translation termination and/or 
nonsense-mediated mRNA decay) or one or more symptoms thereof resulting fiom the 
administration of one or more conqjounds identified in accordance flie methods of the 
invention or the administration of a combination of such a compound and a known therapy 
for such a disorder. 

As used her&n, the terns "prophylactic agent" and "prophylactic agents" 
refer to any agent(s) which can be used in the prevention of a disorder (e.g., a disorder 
associated with, characterized by or caused by premature translation termination and/or 
nonsense-mediated mRNA decay). In certain embodiments, the term "prophylactic agent" 
refers to a compound identified in the screening assays described herein. In certain other 
embodiments, the term "prophylactic agent" refers to an agent other than a compound 
identified in the screening assays described herein which is known to be useful for, or has 
been or is currently being used to prevent or unpede the onset, development and/or 
progression of a disorder (e.g., a disorder associated with, characterized by or caused by 
premature translation termination and/or nonsense-mediated mKNA decay) or one or more 
symptoms thereof. 

As used herem, the phrase "prophylactically effective amount" refers to the 
amount of atherapy (e.g., aprophylactic agent) which is sufficient to result in the 
prevention of the development, recurrence or onset of one or more symptoms associated 
with a disorder (e.g., a disorder associated with, characterized by or caused by premature 
translation temiination and/or nonsense-mediated mRNA decay). 
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5 As used herein, the term "purified," m the context of a compound, a 

compound identified in accordance with the method of the inv^tion, refers to a compound 
that is substantially free of chemical precursors or other chemicals when chemically 
synthesized. In a specific embodiment, the compound is 60%, preferably 65%, 70%, 75%, 
80%, 85%, 90%, or 99% fi-ee of other, different compounds. In a preferred embodiment, a 

10 compound identified ia accordance with the methods of the invention is purified. 

As used herein, the terai "reporter gene" refers to a nucleotide sequence 
encoding a protein, polypeptide or peptide that is readily detectable eiflier by its presence or 
activity. Any reporter gene well-known to one of skill in the art may be used in reporter 
gene constructs to ascertain the effect of a compound on premature translation termination. 

IS As used herein, the term "small molecule" and analogous temis include, but 

are not limited to, peptides, peptidomimetics, amino acids, amino acid analogs, 
polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, organic or 
inorganic compounds (f.^., including heterorganic and/or ganometaUic compounds) having 
a molecular weight less than about 10,000 grams per mole, organic or inorganic compounds 

20 havmg a molecular weight less than about 5,000 grams per mole, organic or inorganic 
compounds having a molecular weight less than about 1,000 grams per mole, organic or 
inorganic compounds having a molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutically acceptable forms of such compounds. 

As used herein, tihie terms "subject" and '*patienf' are used interchangeably 

25 herein. The terms "subject" and "subjects" refer to an animal, preferably a mammal 
including a non-primate (e.g., a cow, pig, horse, cat, dog, rat, and mouse) and a primate 
(e.g;, a chimpanzee, a monkey such as a cynomolgous monkey and a human), and more 
preferably a human. In one embodiment, the subject is refiractory or non-responsive to 
current therapies for a disorder (e.g., a disorder associated with, characterized by or caused 

30 by premature translation termination and/or nonsense-mediated mRNA decay). In another 
embodiment, the subject is a farm animal a horse, a cow, a pig, etc.) or a pet (eg., a 
dog or a cat). In a preferred embodiment, the subject is a human. 

As used herein, the term "synergistic" refers to a combination of a 
compound identified using one of the methods described herein, and another therapy (e.g., a 

35 prophylactic or therapeutic agent), which combination is more effective than the additive 
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5 effects of the therapies. A synergistic effect of a combinatioii of therapies 

prophylactic or therapeutic agents) pennits the use of lower dosages of one or more of the 
therapies and/or less frequent administration of said therapies to a subject with disorder 
(e,g., a disorder associated with, characterized by or caused by premature translation 
termination and^or nonsense-mediated mRNA decay). The ability to utilize lower dosages 

10 of a therapy (e.g., a prophylactic or therapeutic agent) and/or to administer said therapy less 
frequently reduces the toxicity associated with the adnunistration of said therapy to a 
subject without reducing the efficacy of said therapies in the prevention, treatment, 
management or amelioration of a disorder (e.g.y a disorder associated with, characterized by 
or caused by premature translation termination and/or nonsense-mediated mRNA decay). 

15 In addition, a synergistic effect can result in improved efficacy of therapies (e.g., 
prophylactic or therapeutic agents) in the prevention, treatment, management or 
amelioration of a disorder (e.g., a disorder associated with, characterized by or caused by 
premature translation termination and/or nonsense-mediated mRNA decay). Finally, a 
synergistic effect of a combination of therapies prophylactic or therapeutic agents) 

20 may avoid or reduce adverse or unwanted side effects associated with the use of either 
therapy alone. 

As used herein, the term "substantially one type of compound" means that 
the assay can be performed in such a fashion that at some point, only one compound need 
be used in each reaction so that, if the result is indicative of a binding event occurring 
25 between the target RNA molecule and the compound, the compound can be easily 
identified 

As used herein, a **target nucleic acid" refers to RNA, DNA, or a chemically 
modified variant thereof. Inapreferredembodiment, the target nucleic acid is RNA. A 
target nucleic acid also refers to tertiary structures of the nucleic acids, such as, but not 
30 limited to loops, bulges, pseudoknots, guanosine quartets and turns. A target nucleic acid 
also refers to RNA elements such as, but not linoited to, 28S rRNA and structural analogs 
thereof, which are described in Sections 5.1 and 5.2. Non-limiting examples of target 
nucleic acids are presented in Sections 5.1 and 5.2. 

As used herein, a *target RNA" refers to RNA or a chemically modified 
35 variant thereof. A target RNA also refers to tertiary structures of RNA, such as, but not 
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5 limited to loops, bulges, pseudoknots, guanosine quartets and turns. A target RNA also 
refers to RNA elements such as, but not limited to, 28S rKMA and structural analogs 
thereof, which are described in Sections 5.1 and 5.2. Non-limiting examples of target 
RNAs are presented in Sections 5. 1 and 5.2. In a specific embodiment, a target RNA is at 
least 25 nucleotides, preferably at least 30 nucleotides, at least 35 nucleotides, at least 40 

10 nucleotides, at least 45 nucleotides, at least 50 nucleotides, at least 55 nucleotides, at least 
60 nucleotides, at least 65 nucleotides, at least 70 nucleotides, at least 75 nucleotides, at 
least 80 nucleotides, at least 85 nucleotides, at least 90 nucleotides, at least 95 nucleotides, 
at least 100 nucleotides, at least 125 nucleotides, at least 150 nucleotides, at least 175 
nucleotides or at least 200 nucleotides in length. 

15 As used herein, the terms '*ther25)eutic agent" and **ther^eutic agents" refer 

to any agent(s) which can be used in the prevention, treatment, management or amelioration 
of one or more symptoms of a disorder (e.g,, a disorder associated witih, characterized by or 
caused by premature translation termination and/or nonsense-mediated mRNA decay). In 
certain embodiments, the term "'therapeutic agent" refers to a compound identified in the 

20 screening assays described herein. In other embodiments, the term ^"therapeutic agent" 

refers to an agent other than a compoimd identified in the screening assays described herein 
which is known to be usefid for, or has been or is currently being used to prevent, treat, 
manage or ameliorate a disorder a disorder associated with, characterized by or caused 
by premature translation termination and/or nonsense-mediated mRNA decay) or one or 

25 more symptoms thereof. 

As used herein, the temi "therapeutically effective amount" refers to that 
amount of a therapy {e.g., a therapeutic agent) sufficient to result in (i) the amelioration of 
one or more symptoms of a disorder a disorder associated with, characterized by or 
caused by premature translation termination and/or nonsense-mediated niElNA decay), (ii) 

30 prevent advancement of a disorder (e.g,, a disorder associated with, charact^zed by or 
caused by premature translation termmation and/or nonsense-mediated mRNA decay), (iii) 
cause regression of a disorder a disorder associated with, characterized by or caused 
by premature translation termination and/or nonsense-mediated mRNA decay), or (iv) to 
enhance or improve the therapeutic efrect(s) of another therapy (e.g., therapeutic agent). 
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5 As used herein, the terms *treat", 'treatment and 'treating" refer to the 

reduction or amelioration of the progression, severity and/or duration of a disorder (e.g., a 
disorder associated with, characterized by or caused by premature translation termination 
and/or nonsense-mediated mRNA decay) or one or more symptoms thereof resulting jfrom 
the administration of one or more compounds identified in accordance the methods of the 

1 0 invention, or a combination of one or more compounds identified in accordance with the 
invention and another therapy. 

As used herein, the terms "therap/' and 'therapies" refer to any method, 
protocol and/or agent that can be used in the prevention, treatment, management or 
amelioration of a disesase or disorder or one or more symptoms thereof. In certain 

15 embodiments, such terms refer to chemotherapy, radiation therapy, surgery, siqpportive 
therapy and/or other therapies usefiil in the prevention, treatment, management or 
amelioration of a disease or disorder or one or more symptoms thereof known to skilled 
medical personnel. 

4. DESCRIPTION OF DRAWINGS 
20 FIG. 1. The human 288 rRNA. Domains U and V are circled. 

FIG. 2. Gel retardation analysis to detect peptide-RNA interactions. In 20 fil 
reactions containing 50 pmole end-labeled TAR RNA oligonucleotide, increasing 
concentrations of Tat47.58 peptide (0.1 uM, 0.2 uM, 0.4 uM, 0.8 uM, 1.6 uM) was added in 
TK buflfer. The reaction mixture was then heated at 90°C for 2 min and allowed to cool 
25 slowly to 24°C. 1 0 pi of 30% glycerol was added to each sample and applied to a 12% 
non-denaturing polyacrylamide gel. The gel was electrophoresed using 1200 volt-hours at 
4**C in TBE Buflfer. Following electrophoresis, the gel was dried and the radioactivity was 
quantitated with a phosphorimager. The concentration of peptide added is indicated above 
each lane. 

30 FIG. 3. Gentamicin interacts with an oligonucleotide corresponding to the 

16S rRNA 20 \i\ reactions containing increasing concentrations of gentamicin (1 ng/ml, 10 
ng/ml, 100 ng/ml, 1 jig/ml, 10 |ig /ml, 50 pg /ml, 500 p.g /ml) were added to 50 pmole 
end-labeled RNA oligonucleotide in TKM buffer, heated at 90°C for 2 min and allowed to 
cool slowly to 24°C. 10 \il of 30% glycerol was added to each sample and the samples 

35 were applied to a 13.5% non-denaturing polyacrylamide gel. The gel was electrophoresed 
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5 using 1200 volt-hours at 4°C in TBE Buffer. Following electrophoresis, the gel was dried 
and the radioactivity was quantitated using a phosphorimager. The concentration of 
gentamicin added is indicated above each lane. 

FIG. 4. The presence of 10 pg/ml gentamicin produces a gel mobility shift in 
the presence of the 16S rRNA oligonucleotide. 20 |a.l reactions containing increasing 

10 concentrations of gentamicin (100 ng/ml, 10 ng/ml, 1 ng/ml, 100 pg/ml, and 10 pg/ml) were 
added to 50 pmole end-labeled RNA oUgonucleotide in TKM buffer were treated as 
described for Figure 3. 

FIG. 5. Gentamicin binding to the 16S rKNA oHgonucleotide is weak in the 
absence of MgCb. Reaction mixtures containing gentamicin (1 jig /ml, 100 \ig Ivd^ 10 \ig 

15 /ml,l fig /ml, 0.1 fig /ml, and 10 ng/ml) were treated as described in Figure 3 except that the 
TKM buffer does not contain MgCk. 

FIG. 6. Gel retardation analysis to detect peptide-RNA interactions. In 
reactions containing increasing concentrations of Tat47-58 peptide (O.l mM, 0.2 mM, 0.4 
mM, 0.8 mM, 1 .6 mM) 50 pmole TAR RNA oMgonucleotide was added in TK buffer. The 

20 reaction mixture was then heated at 90**C for 2 min and allowed to cool slowly to 24^C. 
The reactions were loaded onto a SCE9610 automated capillary electrophoresis apparatus 
(SpectruMedix; State College, Pennsylvania). The peaks correspond to the amount of free 
TAR RNA ("TAR") or the Tat-TAR complex ("Tat-TAR"). The concentration of peptide 
added is indicated below each lane. 

25 FIG. 7. Small molecules involved in nonsense suppression alter the chemical 

footprinting pattem in Domain V of the 288 rRNA. 1 00 pmol of ribosomes were incubated 
with 100 juM compound, followed by treatment with the chemical modifying agents 
kettioxal (KE) and dimethyl sulfate (DMS, not shown). Following chenaical modification, 
rRNA was prepared and analyzed in primer extension reactions using end-labeled 

30 oUgonucleotides hybridizing to rRNA A sequencing reaction was run in parallel as a 
marker. 

FIG. 8. Small molecules involved in nonsense suppression alter the chemical 
footprinting pattem in Domain V of the 28S rRNA. 100 pmol of ribosomes were incubated 
with 100 juM compound, followed by treatment with the chemical modifying agents 
35 kethoxal (KE) and dimethyl sulfate (DMS, not shown). Following chemical modification, 

26 



wo 2004/010106 PCT/US2003/023075 



5 rKNA was prepared and analyzed in primer Kctension reactions using end-labeled 
oligonucleotides hybridizing to rRNA. A sequencing reaction was run in parallel as a 
marker. 

FIG. 9- Small molecules involved in nonsense suppression alter the 
chemical footprinting pattern in Domain n (GTPase Center) of the 28S rRNA. 1 00 pmol of 

1 0 ribosomes were incubated with 1 00 fM compound, followed by treatment with the 
chemical modifying agents kethoxal (KE) and dimethyl sulfate (DMS, not shown). 
Following chemical modification, rRNA was prepared and analyzed in primer extension 
reactions using end-labeled oUgonucleotides hybridizing to rRNA. A sequencing reaction 
was run in parallel as a marker. 

15 FIG. 10. Small molecules involved in nonsense suppression alter the 

chemical foo^rinting pattern of domain n of the 28S rRNA. 100 pmol of ribosomes were 
incubated with 100 peM compound, followed by treatment with chemical modifying agents 
dimethyl sulfate (DMS) and ketfioxal (KE). Following chemical modilBcation, rRNA was 
prepared and analyzed in primer extension reactions using end-labeled oligonucleotides 

20 hybridizing to rRNA A sequencing reaction was run in parallel as a marker. 

FIG. 1 1. A specific region of Domain II can compete for compound binding 
and prevents nonsense suppression in vitro. The in vitro nonsense suppression assay was 
performed using a luciferase construct with a UGA nonsense mutation. 0.1 mM compound 
was present in the reaction to induce nonsense suppression. Competitor RNA 

25 corresponding to Domain n was added at the indicated concentrations (0, 1, 2.5, 5, 7.5, 10 
pM) to titrate the small molecule and prevent nonsense suppression. 

5. DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides methods for identifying compounds that 
modulate translation termination and/or nonsense-mediated mRNA decay by identifying 
30 compounds that bind to preselected target elements of nucleic acids including, but not 
limited to, specific RNA sequences, RNA structural motifs, and/or RNA structural 
elements. In particular, the present invention provides methods of identifying compounds 
that bmd to regions of the 28S rRNA and analogs thereof. The specific target RNA 
sequences, RNA structural motife, and/or RNA structural elements (/.e., regions of the 28S 
35 rRNA and analogs thereof) are used as targets for screemng small molecules and 
identifying those that directly bind these specific sequences, motifs, and/or structural 
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5 elements. For example, methods are described in which a preselected target RNA having a 
detectable label or method of detection is used to screen a library of compounds, preferably 
under physiologic conditions; and any complexes formed between the target RNA and a 
member of the hbrary are identified using physical methods that detect the labeled or 
altered physical property of the target RNA bound to a compound. Further, methods are 

10 described in which a preselected target KNA is used to screen a library of compounds, with 
each compound in the library having a detectable label or method of detection, preferably 
under physiologic conditions; and any complexes formed between the target KNA and a 
member of the library are identified using physical metibiods that detect the labeled or 
altered physical property of the compound bound to target RNA. 

1 5 The present invention provides methods for identifying compounds that bind 

to a target RNA (e.g., regions or fragments of 28S rRNA or RNA containing a premature 
stop codon), said methods comprising contacting a target RNA having a detectable label 
with a library of compounds free in solution, and detecting the formation of a target 
RNA:compound complex. In particular, the present invention provides methods for 

20 identifying compounds that bind to a target RNA (e.g., regions or fragments of 28S rRNA 
or RNA containing a premature stop codon), said methods comprising contacting a target 
RNA having a detectable label with a Ubrary of compounds free in solution, in, e.g.y 
labeled tubes or a microtiter plate, and detecting the formation of a target RNAxompound 
complex. Compounds in the library that bind to the labeled target RNA will form a 

25 detectably labeled complex. The detectably labled complex can then be identified and 

removed from the uncomplexed, unlabeled complex, and from uncomplexed, labeled target 
RNA, by a variety of methods, including, but not limited to, methods that differentiate 
changes in the electrophoretic, chromatographic, or thermostable properties of the 
complexed target RNA. Such methods include, but are not limited to, electrophoresis, 

30 fluorescence spectroscopy, surfrtce plasmon resonance, mass spectrometry, scintillation 
proximity assay, structure-activity relationships ("SARS") by NMR spectroscopy, size 
exclusion chromatography, aflBboity chromatogrjqphy, and nanoparticle aggregation. 

The present invention provides methods for identifying compounds that bind 
to a target RNA (eg., regions or fragments of 28S rRNA or RNA containing a premature 

35 stop codon), said methods comprising contacting a target RNA with a library of 

compounds, wherein each compound in the Ubrary is detectably labeled, and detecting the 
formation of a target RNA:compound complex. In particular, the present invention 
provides methods for identifydng compounds that bind to a target RNA {e,g,, regions or 
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5 fiagments of 28S rRNA, or KNA containing a premature stop codon), said methods 
comprising contacting a target RNA with a library of compounds free in solution, in, e.g., 
labeled tubes or a microtiter plate, wherein each compound in the Ubraiy is detectably 
labeled, and detecting the formation of a target RNA:conq)ound complex. Compounds in 
the library that bind to the labeled target RNA will fonn a detectably labeled complex. The 
10 detectably labled complex can then be identified and removed from the uncomplexed, 
unlabeled complex, and from uncomplexed, target KNA, by a variety of methods, 
including, but not limited to, methods lhat differentiate changes in the electrophoretic, 
chromatographic, or thennostable properties of the complexed target RNA Such methods 
inchide, but are not limited to, electrophoresis, fluorescence qiectioscqpy, surface plasmon 
15 resonance, mass spectrometry, scintillation proximity assay, structure-activity relationships 
("SARS") by NMR spectroscopy, size exclusion chromatography, afiBnity chromatography, 
and nanoparticle aggregation. 

The present invention provides methods for identifying compounds that bind 
to a target RNA {e.g., regions or fragments of 28S iRNA or RNA containing a premature 
JO stop codon), said methods comprising contacting a target RNA having a detectable label 
wilh a Ubrary of compounds bound, wherein each compound in the library is attached to a 
soKd support; and detecting the formation of a target RNArcompound complex. In 
particular, the present invention provides methods for identifying compounds that bind to a 
target RNA (in particular, regions of 28S rRNA or RNA contaimng a premature stop 
t5 codon). said method comprising contacting a target RNA having a detectable label with a 
library of compounds, wherein each conq)ound is attached to a solid support {e.g., a bead- 
based horary of compounds or a microarray of conq)ounds), and detecting the formation of 
a target RNA:conq)ound complex. Compounds in the Ubrary that bind to the labeled target 
RNA will form a detectably labeled complex. Conq>ounds in the Kbiaiy that bind to the 
0 labeled target RNA will form a soUd support (e.g., a bead-based) detectably labeled 

conq)lex, which can be sq)arated from the unbound beads and unbound target RNA in the 
Kquid phase by a number of physical means, including, but not limited to, flow cytometiy, 
afBnity chromatography, manual batch mode separation, suspension of beads in electric 
fields, and microwave of the bead-based detectably labeled complex. 
5 The present invaition provides methods for identifying compounds that bind 

to a target RNA {e.g., regions of 28S rRNA or RNA containing a premature stop codon), 
said methods comprising contacting a target RNA attached or conjugated to a soKd support 
with a Kbrary of compounds, wherem each compound in the library is detectably labeled, 
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5 and detecting the formation of a target RNAicompound complex. Target KNA molecules 
that bind to labeled compounds will form a detectable labeled complex. Target RNA 
molecules that bind to labeled compounds will form a solid support-based detectably 
labeled complex, which can be separated firom the unbound solid support-target SNA and 
unbound compounds in the liquid phase by a number of means, including, but not limited 

10 to, flow cytometry, affinity chromatography, manual batch mode separation, suspension of 
beads in electric fields, and microwave of the bead-based detectably labeled complex. 

Thus, the methods of the present invention provide a simple, sensitive assay 
for high-tibroughput screening of libraries of compounds, in which tiie compounds of the 
library that specifically bind a preselected tairget nucleic acid are easily distinguished firom 

15 non-binding members of the library. In one embodiment, the structures of the binding 
molecules are deciphered fi'om the input library by methods depending on the type of 
library that is used. In another embodiment, die structures of the binding molecules are 
ascertained by de novo structure determination of the con5)ounds using, for example, mass 
spectrometry or nuclear magnetic resonance C*NMR"). The compounds so identified are 

20 usefiil for any purpose to which a binding reaction may be put, for example in assay 

methods, diagnostic procedures, cell sorting, as inhibitors of target molecule fimction, as 
probes, as sequestering agents and lead compounds for development of therapeutics, and the 
like. Small organic conq)ounds that are identified to interact specifically with the target 
RNA molecules are particularly attractive candidates as lead compounds for the 

25 development of th^apeutic agents. 

The assaj^ of the invention reduce bias introduced by competitive binding 
assays which require the identification and use of a host cell factor (presumably essential 
for modulating RNA fimction) as a binding partner for the target RNA. The assays of the 
present invention are designed to detect any compound or agent that binds to 28S rRNA, 

30 preferably under physiologic conditions. Such agents can tiien be tested for biological 
activity, without estabhshing or guessmg which host cell factor or factors is required for 
modulating the fimction and/or activity of 28S rRNA. 

S.l 28S rRNA and Analogs Thereof 

The ribosome is a 2.5-MDa ribonucleoprotein complex involved in the 
35 decoding of genetic material firom mRNA to proteins. A combination of biophysical and 
biochemical analysis have provided three dimensional models of the ribosome as well as 
detailed analyses into the mechanism of the. individual steps in translation (see, Green 
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5 & NoUer, 1997, Amu. Rev. Biochem. 66:679-716; Gate et al, 1999, Science 
285(5436):2095-2104; and Ban et al, 2000, Science.289(5481): 905-920.). 

The 28S rKNA is one of the ribosomal RNA components of the 60S subunit 
of eukaryotic ribosomes. The 28S rRNA sequences are conserved when expressed as 
mature rRNAs, although the 28S rRNA contains variable sequence tracts that are 

10 interspersed among conserved core sequences and lacking in the counterpart bacterial 23S 
rRNA (see, e.g., Hancock & Dover, 1988, Mol. Biol. EvoL 5:377-391). A diagram of the 
28S rRNA is presented in Figure 1, with domains U and V circled. As indicated in Figure 
1, a GTPase center has been mapped to domain n and the peptidyl transferase center has 
been mapped to domain V. 

15 Compounds that interact in these regions or modulate local changes within 

these domains of the ribosome alter base pairing interactions, base modification or 
modulate binding of trans-acting factors that bind to these regions) have the potential to 
modulate translation termination. These regions, z.c., domains n and V are conserved fi:om 
prokaryotes to eukaryotes, but the role of these regions in modulatmg translation 

20 termination has not been realized in eukaryotes. In bacteria, when a short RNA fragment, 
complementary to the E. coli 23S rRNA segment comprising nucleotides 735 to 766 (in 
domain U), is expressed in vivo, suppression of UGA nonsense mutations, but not UAA or 
UAG, results (Chemyaeva et al, 1999, J Bacterid 181 :5257-5262). Other regions of the 
23 S rRNA in-E. coli have been imphcated in nonsense suppression including the GTPase 

25 center in domain II (nt 1034-1 120; Jemiolo et al, 1995, Proc. Nat. Acad. Sci. 
92:12309-12313). 

Genetic mutations in bact^a have also identified rRNA mutations that 
either inCTease the level of firameshifting in the trpE or the suppression of a nonsense 
mutations in the trpA gene (reviewed in Green & Noller, 1997, Annu. Rev. Biochem. 

30 66:679-71 6). The firameshifting mutations mapped to domains IV and V of the 23S rRNA. 
Disruption of the interaction of the CCA end of the tRNA with the peptidyl transferase 
center of the ribosome has been demonstrated to result in an increased translational error 
firequency (reviewed in Green & Noller, 1997, Araiu. Rev. Biochem. 66:679-716). 

Regions of the 28S rRNA involved in fiameshifling, nonsense mutation 

35 suppression, GTPase activity, or peptidyl transferase are attractive target RNAs to identify 
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compounds that modulate premature translation temiination and/or nonsense mediated 
decay. The interference of a compound with one or more of these functions could 
potentially mediate translation termination by interfering with preatnature translation 
termination. Without being bound by theory, a compound could potentially mediate 
translation termination by causing read through of a premature translation codon, therefore 
allowing flie synthesis of the full-laigth protein. 

hi a preferred embodiment, the target RNA comprises a region of 28S rRNA 
corresponding to domain H (see, e.g., nucleotides 1310 to 2333 of accession number 
M11167) or domain V of 28S rRNA (see, e.g., nucleotides 3859 to 4425 of accession 
number Ml 1167) or an analog thereof. It will become apparent to one of skill in the art that 
an analog of the 28S rRNA has an analogous structure and function to native 28S rRNA. 
For example, an analog of human 28S rRNA includes, but is not limited to, a human 28S 
rRNA retropseudogene (see, e.g., Wang et al., 1997, Gene 196: 105-1 1 1, Accession Number 
120636). Regions corresponding to domain H or domain V of tiie 28S rRNA pseudogene 
could be used as target RNAs in the present invention, hi a prefeired embodiment, the 28S 
rRNA is a human 28S rRNA, although the teachings of the present invention are applicable 
to mammals. 

Synthesis of the target RNAs, i.e., regions of 28S rRNA, can be performed 
by methods known to one of skill in the art (see, e.g., Sambrook et al, 1989, Molecular 
Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York and Glover, D.M. (ed.). 1985, DNA Cloning: A Practical Approach, 
MRL Press, Ltd., Oxford, U.K. Vol. I, H). In a preferred embodiment, the target RNAs are 
cloned as DNAs downstream of a promoter, such as but not Umited to T7, T3, or Sp6 
promoters, and mi vitro transcribed with the corresponding polymerase. A detectable label 
can be incorporated into the in vitro transcribed RNA or alternatively, the target RNA is 
end-labeled (see Section 5.3 infra). Alternatively, the target RNA can be amplified by 
polymerase chain reaction with a primer containing an RNA promoter and subsequently in 
vitro tianscribed, as described in U.S. Patent No. 6,271, 002, which is incorporated by 
reference in its entirety. 

5.2 Stop Codon Containin|F Target RNA 
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The present invention provides for methods for screening and identifying 
compounds that modulate premature translation termination and/or nonsense-mediated 
mRNA decay. A target RNA may be engineered to contain a premature stop codon or, 
alternatively, a target RNA may naturally contain a premature stop codon. The premature 
stop codon may any one of the stop codons known in the art including UAG, UAA and 
VGA. 

The stop codons are UAG, UAA, and UGA, i.e., signals to the ribosome to 
terminate protein synthesis, presumably through protein release fectois. Even thou^ the 
use of these stop codons is widespread, they are not universal. For example, UGA q)ecifies 
tryptophan in the mitochondria of mammals, yeast, Neuroqjora crassa, Drosophila, 
protozoa, and plants (see, e.g., Breitenberger & RajBhandary, 1985, Trends Biochem Sci 
10;481). Other examples include the use of UGA for tryptophan in Mycoplasma and, in 
ciliated protozoa, the use of UAA and UAG for glutamme (see, e.g.. Jukes et al., 1987, Cold 
Spring Haib Symp Quant Biol. 52:769-776), the use of UGA for cysteme in Ihe dliate 
Euplotes aediculatus (see, e.g., Kervestin et al., 2001, EMBO Rep 2001 Aug;2(8):680-684), 
the use of UGA for tryptophan in Blepharisma americanum and the use of UAR for 
glutamine in Tetrahymena, and three spirotrichs, Stylonychia lemnae, S. mytilus, and 
Oxytricha tiifaUax (see, e.g., Lozupone et al., 2001, Curr Biol 1 1(2):65-74). It has been 
proposed that the ancestral mitochondrion was bearing the universal genetic code and 
subsequently reassigned the UGA codon to tryptophan independently, at least in the lineage 
of ciliates, kinetoplastids, rhodophytes, prymnesiophytes, and fungi (see, e.g., Inagaki et al., 
1998, J Mol Evol 47(4):378-384). 

The readthrough of stop codons also occurs in positive-sense ssRNA viruses 
by a variety of naturaUy occurring suppressor tRNAs. Such naturally-occurring suppressor 
tRNAs include, but are not limited to, cytoplasmic tRNATyr, which reads through the UAG 
stop codon; cytoplasmic tRNAsGhi, which read through UAG and UAA; cytoplasmic 
tRNAsLeu, which read through UAG; chloroplast and cytoplasmic tRNAsTip, which read 
through UGA; chloroplast and cytoplasmic fRNAsCys, which read through UGA; 
cytoplasmic tRNAsArg, which read throu^ UGA (see, e.g., Beier & Grimm, 2001, Nucl 
Acids Res 29(23):4767-4782 for a review); and the use of selenocysteine to suppress UGA 
in E. coli (see, e.g.. Baron & BQck, 1995, The selenocysteme inserting tRNA q)ecies: 
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Structure and function. In SOllJ). and RajBhandary.UX. (eds), tRNA: Structure, 
Biosynthesis and Function, ASM Press, Washington, DC, pp. 529 544). The mechanism ii 
thought to involve unconventional base interactions and/or codon context effects. 

As described above, the stop codons are not necessarily universal, with 
considerable variation amongst organelles (e.g., mitochondria and chloroplasts), viruses 
(e.g., single strand viruses), and protozoa (e.g., ciliated protozoa) as to whether the codons 
UAG, UAA, and UGA signal translation tennination or encode amino acids. Even though 
single release factor most probably recognizes all of the stop codons in eukaryotes, it 
appears that all of the stop codons are not siq>pressed in a similar matter. For example, in 
the yeast Saccharomyces pombe, nonsense suppression has to be strictly codon specific 
(see, Hottinger et al., 1984, EMBO J 3:423-428). In another example, significant 
differaices were found in the degree of suppression amongst three UAG codons and two 
UAA codons in different njRNA contexts in Escherichia coli and in human 293 cells, 
although data suggested that the context effects of nonsaise si^ression operated 
differently in coli and human cells (see, e.g., Martin et aL, 1989, Mol Gen Genet 217(2 
3):41 1 8). Since unconventional base interactions and/or codon context effects have been 
in5)Ucated in nonsense suppression, it is conceivable that compounds involved in nonsense 
siqjpression of one stop codon may not necessarily be involved in nonsense suppression of 
another stop codon. Li other words, compounds involved in suppressing UAG codons may 
not necessarily be involved in suppressing UGA codons. 

In a specific embodiment, a target KNA contains or is engineered to contain 
the premature stop codon UAG. In another embodiment, a target RNA contains or is 
engineered to contain the premature stop codon UGA 

In a particular embodiment a target RNA contains or is engineered to 
contain two or more stop codons. In accordance with this embodiment, the stop codons are 
preferably at least 10 nucleotides, at least 15 nucleotides, at least 20 nucleotides, at least 25 
nucleotides, at least 30 nucleotides, at least 35 nucleotides, at least 40 nucleotides, at least 
45 nucleotides, at least 50 nucleotides, at least 75 nucleotides or at least 100 nucleotides 
apart fiiom each other. Further, in accordance with this embodiment, at least one of the stop 
codons is preferably UAG or UGA. 
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5 In a specific embodim^t, a target RNA contains or is engineered to contain 

a premature stop codon at least 15 nucleotides, preferably at least 20 nucleotides, at least 25 
nucleotides, at least 30 nucleotides, at least 35 nucleotides, at least 40 nucleotides, at least 
45 nucleotides, at least 50 nucleotides or at least 75 nucleotides from the start codon in the 
coding sequence. In another embodiment, a target RNA contains or is engineered to 

10 contain a premature stop codon at least 15 nucleotides, preferably at least 25 nucleotides, at 
least 50 nucleotides, at least 75 nucleotides, at least 100 nucleotides, at least 125 
nucleotides, at least 150, at least 175 nucleotides or at least 200 nucleotides firom the native 
stop codon in the coding sequence of the full-length protein, polypeptide or peptide. In 
another embodiment, a target RNA contains or is engineered to contain a premature stop 

15 codon at least 15 nucleotides (preferably at least 20 nucleotides, at least 25 nucleotides, at 
least 30 nucleotides, at least 35 nucleotides, at least 40 nucleotides, at least 45 nucleotides, 
at least 50 nucleotides or at least 75 nucleotides) from the start codon in the coding 
sequence and at least 15 nucleotides (preferably at least 25 nucleotides, at least 50 
nucleotides, at least 75 nucleotides, at least 100 nucleotides, at least 125 nucleotides, at least 

20 150, at least 175 nucleotides or at least 200 nucleotides) from the native stop codon in the 
coding sequence of the full-length protein, polypeptide or peptide. In accordance with these 
embodiments, the premature stop codon is preferably UAG or UGA 

The premature translation stop codon can be produced by in vitro mutagenesis 
techniques such as, but not limited to, polymerase chain reaction ("PGR*'), linker insertion, 

25 oligonucleotide-mediated mutagenesis, and random chanical mutagenesis. 

5.3 Target RNAs a)etectablv Labeled or Attached to a Solid Support) 

Target nucleic acids, including but not limited to RNA and DNA, useful in 
the methods of the present invention have a label that is detectable via conventional 
spectroscopic means or radiographic means. Preferably, target nucleic acids are labeled 
30 with a covalently attached dye molecule. Useful dye-molecule labels include, but are not 
limited to, fluorescent dyes, phosphorescent dyes, ultraviolet dyes, infrared dyes, and 
visible dyes. Preferably, the dye is a visible dye. 

Useful labels in the present invention can include, but are not limited to, 
spectroscopic labels such as fluorescent dyes (e.g., fluorescein and derivatives such as 
35 fluorescein isothiocyanate (FITC) and Oregon Green™, rhodamine and derivatives (e.g., 

35 
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5 Texas red, tetramethykhodimine isothiocynate (TRTTC), bora-3a,4a-diaza-s-indacene 
(BODIPY®) and derivatives, etc.), digoxigenin, biotin, phycoerythrin, AMCA, CyDye™, 
and the like), radiolabels (e.g., ^H, ^^I, ^^S, ^"^C, ^^P, etc.), enzymes (e.g., horse radish 
peroxidase, alkaline phosphatase etc.), spectroscopic colorimetric labels such as colloidal 
gold or colored glass or plastic (e.g. polystyrene, polypropylene, latex, etc.) beads, or 

10 nanoparticles - nanoclusters of inorganic ions with defined dimension firom 0.1 to 1000 run. 
Useful affinity tags and complimentary partners include, but are not limited to, 
biotin-streptavidin, complimentary nucleic acid Augments (e.g., oligo dT-oligo dA, oligo 
T-oligo A, oligo dG-oligo dC, oligo G-oligo C), aptamer-streptavidin, or h^tens and 
proteins for which antisera or monoclonal antibodies are available. The label may be 

15 coupled direcfly or indirectly to a component of the detection assay ie.g., the detection 

reagent) according to methods well known in the art A wide variety of labels may be used, 
with the choice of label depending on sensitivity required, ease of conjugation with the 
compound, stability requirements, available instrumentation, and disposal provisions. 

In one embodiment, nucleic acids that are labeled at one or more specific 

20 locations are chemically synthesized using phosphoramidite or other solution or solid-phase 
methods. Detailed descriptions of the chemistry used to form polynucleotides by the 
phosphoramidite method are well known (see, e.g., Caruthers et al, U.S. Pat. Nos. 
4,458,066 and 4,415,732; Carutiiers et al, 1982, Genetic Engineering 4:1-17; Users Manual 
Model 392 and 394 Polynucleotide Synthesizers, 1990, pages 6-1 through 6-22, Applied 

25 Biosystems, Part No. 901237; Ojwang, et al, 1997, Biochemistry, 36:6033-6045). The 
phosphoramidite method of polynucleotide synthesis is the preferred method because of its 
efficient and rapid coupling and the stability of the starting materials. The synthesis is 
performed with the growing polynucleotide chain attached to a soUd support, such that 
excess reagents, which are generally in the liquid phase, can be easily removed by washing, 

30 decanting, and/or filtration, thereby eliminating the need for purification steps between 
synthesis cycles. 

The following briefly describes illustrative steps of a typical polynucleotide 
synthesis cycle using the phosphoramidite method. First, a solid support to which is 
attached a protected nucleoside monomer at its 3* terminus is treated with acid, e.g., 
35 trichloroacetic acid, to remove the 5'-hydroxyl protecting group, freeing the hydroxyl group 
for a subsequent coupling reaction. After the coupling reaction is completed an activated 

36 



wo 2004/010106 PCTAJS2003/023075 



5 intennediate is formed by contacting the support-bound nucleoside with a protected 
nucleoside phosphoramidite monomer and a weak acid, e.g., tetrazole. The weak acid 
protonates the nitrogen atom of the phosphoramidite forming a reactive intermediate. 
Nucleoside addition is generally complete within 30 seconds. Next, a capping step is 
performed, which terminates any polynucleotide chains that did not undergo nucleoside 

10 addition. Capping is preferably performed using acetic anhydride and 1-methylimidazole. 
The phosphite group of the intemucleotide linkage is then converted to the more stable 
phosphotriester by oxidation using iodine as the preferred oxidizing agent and water as ttie 
oxygen donor. After oxidation, the hydroxyl protecting group of the newly added 
nucleoside is removed with a protic acid, e,g., trichloroacetic acid or dichloroacetic acid, 

15 and the cycle is repeated one or more times imtil chain elongation is complete. After 
synthesis, the polynucleotide chain is cleaved fiom the support using a base, e.g., 
ammonium hydroxide or t-butyl amine. The cleavage reaction also removes any phosphate 
protecting groups, e.g„ cyanoethyl. Finally, the protecting groups on the exocyclic amines 
of the bases and any protecting groups on the dyes are removed by treating the 

20 polynucleotide solution in base at an elevated temperature, e.g., at about 55**C. Preferably 
the various protecting groups are removed usmg anmionium hydroxide or t-butyl amine. 

Any of the nucleoside phosphoramidite monomers can be labeled using 
standard phosphoramidite chemistry methods (Hwang et al, 1999, Proc. Natl. Acad. Sci. 
USA 96(23):12997-13002; Ojwang et al, 1997, Biochemistry. 36:6033-6045 and 

25 references cited therein). Dye molecules useftil for covalently coupling to 

phosphoramidites preferably comprise a primary hydroxyl group that is not part of the dye*s 
chromophore. Illustrative dye molecules include, but are not limited to, disperse dye CAS 
4439-31-0, disperse dye CAS 6054-58-6, disperse dye CAS 4392-69-2 (Sigma-Aldrich, St 
Louis, MO), disperse red, and l-pyroiebutanol (Molecular Probes, Eugene, OR). Other 

30 dyes useftil for coupling to phosphoramidites will be apparent to those of skiU in the art, 
such as fluoroscein, cy3, and cy5 fluorescent dyes, and may be purchased from, e.g.^ 
Sigma-Aldrich, St. Louis, MO or Molecular Probes, Inc., Eugene, OR. 

In another embodiment, dye-labeled target molecules are synthesized 
enzymatically using in vitro transcription (Hwang et al, 1999, Proc. Natt. Acad. Sci. USA 

35 96(23): 12997-13002 and references cited therein). In this embodiment, a mixture of 
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5 ribonucleoside-5'-triphosphates C2?)able of supporting template-directed enzymatic 

extension (eg*., a mixture including GTP, ATP, CTP, and UTP, including one or more dye- 
labeled ribonucleotides; Sigma- Aldrich, St. Louis, MO) is added to a promoter-containing 
DNA template. Next, a polymerase enzyme is added to the mixture under conditions where 
the polymerase enzyme is active, which are well-known to those skilled in the art. A 

10 labeled polynucleotide is formed by the incorporation of the labeled ribonucleotides during 
polymerase-mediated strand synthesis. 

In yet another embodiment of the invention, nucleic acid molecules are end- 
labeled after their synthesis. Methods for labeling the 5*-end of an oligonucleotide include 
but are by no means limited to: (i) periodate oxidation of a 5'-to-5'-coupled ribonucleotide, 

15 followed by reaction with an amine-reactive label (Heller & Morisson, 1985, in Rapid 
Detection and Identification of Infectious Agents, D.T. Kingsbury and S. Falkow, eds., pp. 
245-256, Academic Press); (ii) condensation of ethylenediamine with 5'-phosphorylated 
polynucleotide, followed by reaction with an amine reactive label (Morrison, European 
Patent Application 232 967); (iii) introduction of an aliphatic amine substituent using an 

20 aminohexyl phosphite reagent in solid-phase DNA synthesis, followed by reaction with an 
amine reactive label (Cardullo et al, 1988, Proc. Natl. Acad. Sci. USA 85:8790-8794); and 
(iv) introduction of a fhiophosphate group on the 5 '-end of the nucleic acid, using 
phosphatase treatment followed by end-labeling with ATP-7S and kinase, which reacts 
specifically and efficiently with maleimide-labeled fluorescent dyes (Czworkowski et al^ 

25 1991, Biochem. 30:4821-4830). 

A detectable label should not be incorporated into a target nucleic acid at the 
specific binding site at which compounds are likely to bind, since the presence of a 
covalently attached label might interfere sterically or chemically with the binding of tiie 
compounds at this site. Accordingly, if the region of the target nucleic acid that binds to a 

30 host cell factor is known, a detectable label is preferably incorporated into the nucleic acid 
molecule at one or more positions that are spatially or sequentially remote firom the binding 
region. 

After synthesis, the labeled target nucleic acid can be purified using standard 
techniques known to those skilled in the art {see Hwang et al^ 1999, Proc. Natl. Acad. Sci. 
35 USA 96(23): 12997-13002 and references cited therein). Depending on the length of the 
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S target nucleic acid and the method of its synthesis, such purification techniques include, but 
are not limited to, reverse-phase high-performance liquid chromatography C*reverse-phase 
HPLC")> fast performance liquid chromatography ('TPLC"), and gel purification. After 
purification, the target RNA is refolded into its native conformation, preferably by heating 
to approximately 85-95**C and slowly cooling to room temperature in a buffer, e.g., a buffer 

10 comprising about 50 mM Tris-HCl, pH 8 and 100 mM NaCl. 

In another embodiment, the target nucleic acid can also be radiolabeled. A 
radiolabel, such as, but not hmited to, an isotope of phosphorus, sulfiir, or hydrogen, may 
be incorporated into a nucleotide, which is added either after or during the synthesis of the 
target nucleic acid. Methods for the synthesis and purification of radiolabeled nucleic acids 

15 are well known to one of skill in the art. See, e.g., Sambrook et al , 1 989, in Molecular 
Cloning: A Laboratory Manual, pp 10.2-10.70, Cold Spring Harbor Laboratory Press, and 
the refCTences cited therein, which are hereby incorporated by reference in their entireties. 

In another embodiment, the target nucleic acid can be attached to an 
inorganic nanoparticle. A nanoparticle is a cluster of ions wilh controlled size firom 0.1 to 

20 1000 nm comprised of metals, metal oxides, or semiconductors including, but not linaited to 
Ag2S, ZnS, CdS, CdTe, Au, or Ti02. Nanoparticles have imique optical, electronic and 
catalytic properties relative to bulk materials which can be adjusted according to the size of 
the particle. Methods for the attachment of nucleic acids are well known to one of skill in 
the art (see, Niemeyer, 2001, Aagew. Chem. hit. Ed. 40: 4129-4158, International 

25 Patent Publication WO/021 8643, and the references cited therein, the disclosures of which 
are hereby incorporated by reference id their entireties). 

In yet another embodiment of the invention, target nucleic acids can be 
attached or conjugated to a solid support for use in the assays of the invention. There are a 
number of methods, known in the art, that can be used to immobilize nucleic acids on a 

30 soUd support. For exantiple, modified DNA has been covalentiy immobilized to a variety of 
surfaces using amino acids (see, e^g.. Running, J. A, and Urdea, M. S. (1990) 
Biotechniques, 8, 276-277), (Newton, C. R., et al., (1993) Nucl. Acids. Res., 21 1155- 
1162.), (Nikiforov, T. T., and Rogers, Y. H. (1995) Anal. Biochem., 227, 201-209). 
Alternatively, carboxyl groups, (Zhang, Y., et al., (1991) Nucl. Acids Res., 19, 3929-3933), 

35 epoxy groups (Lamture, J. B., et al., (1994) Nucl. Acids Res. 22, 2121-2125), (Eggers, M. 
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5 D., et al., (1994) BioTechniques, 17, 516-524) or amino groups (Rasmussen, S. R., et al., 
(1991) Anal. Biochem., 198, 138-142), can be used to attach nucleic acids to solid surfaces. 
Such embodiments would be usefid in, e.g., high throughput assays intended to screen a 
hbrary of compounds in order to identify molecules that bind to target nucleic acids that 
have been attached to a sohd support. Jn a particular embodiment, target RNA molecules 

10 are attached or conjugated to a sohd support, e.^., a sHde or a bead, using an appropriate 
molecule that does not interfere with its binding to compounds of the invention and then 
subsequently screened with a hbrary of compounds. Members of a hbrary of compounds 
are preferably detectably labeled so that compounds that bind to target RNAs can be 
identified. Suitable detectable labels that can be used to label compounds are known in the 

15 art and also described herein. In a more preferred embodiment, target RNA molecules are 
immobilized on a surface suitable for preforming microarray assays. Any technique known 
in the art can be used to immobilize nucleic acid molecules on a sohd support surface. The 
nucleic acid is preferably, for example, covalently attached to the sohd support. 

5*4 Libraries of Small Molecules 

20 Libraries screened using the methods of the present invention can comprise a 

variety of types of compounds. In one embodiment, the Ubraries screened using the 
methods of the present invention can comprise a variety of types of compounds on solid 
supports. In other embodiments described below, the Ubraries can be synthesized on sohd 
supports or the compounds of the hbrary can be attached to sohd supports by linkers. In 

25 some embodiments, the compounds are nucleic acid or peptide molecules. In a non- 
limiting example, peptide molecules can exist in a phage display library. In other 
embodiments, types of compounds include, but are not limited to, peptide analogs including 
peptides comprising non-naturally occurring amino acids, e.g., D-amino acids, phosphorous 
analogs of amino acids, such as a-amino phosphoric acids and a-amino phosphoric acids, or 

30 amino acids having non-peptide hnkages, nucleic acid analogs such as phosphorothioates 
and PNAs, hormones, antigens, synthetic or naturally occurring drugs, opiates, dopamine, 
serotonin, catecholamines, thrombin, acetylcholine, prostaglandins, organic molecules, 
pheromones, adenosine, sucrose, glucose, lactose and galactose. Libraries of polypeptides 
or proteins can also be used. 
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In a preferred embodiment, the combinatorial libraries are small organic 
molecule libraries, such as, but not limited to, benzodiazepines, isoprenoids, 
thiazolidinones, metathiazanones, pyrrolidines, morpholino compoimds, and 
diazepindiones. In another embodiment, the combinatorial libraries comprise peptoids; 
5 random bio-oligomers; benzodiazepines; diversomers such as hydantoins, benzodiazepines 
and dipeptides; vinylogous polypeptides; nonpeptidal peptidomimetics; oligocarbamates; 
peptidyl phosphonates; peptide nucleic acid libraries; antibody libraries; or carbohydrate 
libraries. Combinatorial libraries are themselves coromercially available (see, e.g., 
Advanced ChemTech Europe Ltd., Cambridgeshire, UK; ASINEX, Moscow Russia; 

10 BioFocus pic, Sittingboume, UK; Bionet Research (A division of Key Organics Limited ), 
Camelford, UK; ChemBridge Corporation, San Diego, California; ChemDiv Inc, San 
Diego, Califomia.; ChemRx Advanced Technologies, South San Francisco, California; 
ComGenex Inc., Budapest, Hungary; Evotec OAI Ltd, Abingdon, UK; IF LAB Ltd., Kiev, 
Ukraine; Maybridge pic, Cornwall, UK; PharmaCore, Inc., North Carolina; SIDDCO Inc, 
. 15 Tucson, Arizona; TimTec Inc, Newark, Delaware; Tripos Receptor Research Ltd, Bude, 
UK; Toslab, Ekaterinburg, Russia). In a specific embodiment, the combinatorial libaries 
are small molecules. 

In another embodiment, combinatorial Ubraries, useful in the present 
invention are combinatorial libraries of labeled compounds with each compound in the 

20 hbrary having a label that is detectable via conventional spectroscopic means or 

radiographic means. Preferably, compounds are labeled with a covalently attached and 
detectable isotope. Other useful labels in the present invention include, but are not limited 
to, fluorescent tags or dye molecules. Useful dye molecules, include, for example, 
fluorescent dyes, phosphorescent dyes, ultraviolet dyes, infi^ared dyes, and visible dyes. 

25 Useful fluorescent tags, include, for example, fluoresein and derivatives such as fluorescein 
isothiocyanate (FITC) and Oregon Green™, rhodamine and derivatives (e.g., Texas red, 
tetramethylrhodinaine isothiocynate (TRTTC), bora-3a,4a-diaza-s-indacene (BODEPY®) and 
derivatives, eta), digoxigenin, biotin, phycoeryflirin, AMCA, CyDye™, and the like), 
radiolabels (eg., ^H, ^^^I, ^^S, ^"^C, ^^P, ^^P, etc.), enzymes (e.g., horse radish peroxidase, 

30 alkaline phosphatase etc.), spectroscopic colorimetric labels such as colloidal gold or 

colored glass or plastic (e.g. polystyrene, polypropylene, latex, etc.) beads, or nanoparticles 
- nanoclusters of inorganic ions with defined dimension from 0.1 to 1000 nm. The label 
may be coupled directly or indirectly to a component of the detection assay (e.g., the 
detection reagent) according to methods well known in the art. A wide variety of labels 

35 may be used, with the choice of label depending on sensitivity required, ease of conjugation 
with the compound, stability requirements, available instrumentation, and disposal 
provisions. 
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In one embodiment, the combinatorial compound library for the methods of 
the present invention may be synthesized. There is a great interest in synthetic methods 
directed toward the creation of large collections of small organic compounds, or Ubraries, 
which could be screened for pharmacological, biological or other activity (DoUe, 2001, J. 
5 Comb, Chem. 3:477-517; Hall et al, 2001, J. Comb. Chem. 3:125-150; DoUe, 2000, J. 
Comb. Chem. 2:383-433; DoUe, 1999, J. Comb. Chem, 1:235-282). The synthetic methods 
applied to create vast combinatorial libraries are performed in solution or in the solid phase, 
Le,y on a soUd siq)port. Solid-phase synthesis makes it easier to conduct multi-step 
reactions and to drive reactions to completion with high yields because excess reagents can 

10 be easily added and washed away after each reaction step. Solid-phase combinatorial 
synthesis also tends to improve isolation, purification and screening. However, the more 
traditional solution phase chentiistry supports a wider variety of organic reactions than 
solid-phase chemistry. Methods and strategies for the synthesis of combinatorial libraries 
can be found in A Practical Guide to Combinatorial Chemistry^ A.W. Czarmk and S.H. 

15 Dewitt , eds,, American Chemical Society, 1997; The Combinatorial Index, B.A. Bimin, 
Academic Press, 1998; Organic Synthesis on Solid Phase, F.Z. Dorwald, Wiley- VCH, 
2000; znA Solid-Phase Organic Syntheses, Vol i, A,W. Czarmk, ed., Wiley Interscience, 
200L 

Combinatorial compound libraries of the present invention may be 

20 synthesized using apparatuses described in US Patent No. 6,358,479 to Frisina et al, U.S. 
Patent No. 6,190,619 to Kilcoin et al, US Patent No. 6,132,686 to Gallup et al, US Patent 
No. 6,126,904 to Zuellig et al, US Patent No. 6,074,613 to Harness et al, US Patent No, 
6,054,100 to Stanchfield et al, and US Patent No. 5,746,982 to Saneii et al which are 
hereby incorporated by reference in their entirety. These patents describe synthesis 

25 ^paratuses capable of holding a plurality of reaction vessels for parallel synthesis of 
multiple discrete compoimds or for combinatorial libraries of compounds. 

In one embodiment, the combinatorial compound library can be synthesized 
in solution. The method disclosed in U.S. Patent No. 6,194,612 to Soger et al, which is 
hereby incorporated by reference in its entirety, features compounds useful as templates for 

30 solution phase synthesis of combinatorial libraries. The template is designed to permit 
reaction products to be easily purified fi"om unreacted reactants using liquid/liquid or 
solid/Uquid extractions. The compounds produced by combinatorial synthesis using the 
template will preferably be small organic molecules. Some compounds in the library may 
mimic the effects of non-peptides or peptides. In contrast to solid-phase synthesis of 

35 combinatorial compound libraries, liquid-phase synthesis does not require the use of 
specialized protocols for monitoring the individual steps of a multistep solid-phase 
synthesis (Egner et aL, 1995, J.Org. Chem. 60:2652; Anderson et al, 1995, J, Org. Chem. 
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60:2650; Fitch et al, 1994, J. Org. Chem. 59:7955; Look et al, 1994, J. Org. Chem. 
49:7588; Metzger et al, 1993, Aagew. Chem., lilt. Ed. Engl. 32:894; Youngquist et at, 
1994, Rapid Commun. Mass Spect 8:77; Chu etal, 1995, L Am. Chem. Soc. 117:5419; 
Bmmmel et al, 1994, Science 264:399; Stevanovic et al, 1993, Bioorg. Med. Chem. Lett. 
5 3:431). 

Combinatorial compound libraries usefiil for the methods of the present 
invention can be synthesized on solid supports. In one embodiment, a split synthesis 
method, a protocol of separating and mixing solid siq)ports during the synthesis, is used to 
synthesize a Ubrary of compounds on solid supports {see Lam et al^ 1997, Chem. Rev. 

10 97:41-448; Ohhneyer et al, 1993, Proc. Natl. Acad. Sci. USA 90:10922-10926 and 

references cited therein). Each solid support in the final library has substantially one type 
of compound attached to its surface. Other methods for synthesizing combinatorial libraries 
on solid supports, wherein one product is attached to each support, will be known to those 
of skill in the art (see, e.g., Nefzi et al., 1997, Chem. Rev. 97:449-472 and US Patent No. 

1 5 6,087, 1 86 to Cargill et al which are hereby incorporated by reference in their entirety). 

As used herein, the term "soUd support" is not limited to a specific type of 
sohd support. Rather a large number of supports are available and are known to one skilled 
in the art. Sohd supports that can be used in the assays of the invention include, for 
example, any surface to which compounds, either natively or via a linker, can be attached. 

20 Sohd supports include siUca gels, resins, derivatized plastic fihns, glass beads, glass slides 
(e.^., Hergenrother et al, 2000, J. Am. Chem, Soc. 122:7849-7850 and Kuruvilla et al, 
2002, Nature 416:653-657) and cotton, plastic beads, polystyrene beads, doped polystyrene 
beads (as described by Fenniri et al, 2000, J. Am. Chem. Soc. 123:8151-8152), polystyrene 
macrobeads (as described by Blackwell et al, 2001, Chemistry & Biology 8:1167-1 182), 

25 alumina gels, and polysaccharides. In a specific embodiment, the solid support is a glass 
shde. In a more specific embodiment, the soUd support is a glass microscope slide. 

A suitable solid support may be selected on the basis of desired end use and 
suitability for various synthetic protocols. For example, for peptide synthesis, a solid 
support can be a resin such as p-mefhylbenzhydrylamine (pMBHA) resin (Peptides 

30 International, Louisville, KY), polystyrenes (e.g., PAM-resin obtained firom Bachem Inc., 
Peninsula Laboratories, etc.), including chloromethylpolystyrene, 
hydroxymethylpolystyrene and aminomethylpolystyrene, poly (dimethylacrylamide)- 
grafled styrene co-divinyl-benzene (e.g., POLYHIPE resin, obtained firom Aminotech, 
Canada), polyamide resin (obtained fix)m Peninsula Laboratories), polystyrene resin grafted 

35 with polyethylene glycol (eg., TBNTAGEL or ARGOGEL, Bayer, Tubingen, Gemiany) 
polydimethylacrylamide resin (obtained firom MilligCT/Biosearch, California), or Sepharose 
(Pharmacia, Sweden). In another embodiment, the solid support can be a magnetic bead 
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coated with streptavidin, such as Dynabeads Streptavidin Pynal Biotech, Oslo, Norway). 

In one embodiinent, the solid phase support is suitable for in vivo use, it can 
serve as a carrier or support for adininistration of the compound to a patient {e.g., 
TENTAGEL, Bayer, Tubingen, Gennany). hi a particular embodiment, the solid support is 
palatable and/or orally ingestable. 

Any technique known to one of skill m the art can be used attach compounds 
to a solid support for use in the assays of the invention. In some embodiments of the 
present invention, compomids can be attached to soUd supports via linkers. Linkers can be 
integral and part of the soUd support, or they may be nonintegral that are either synthesized 
on the soUd support or attached thereto after synthesis. Linkers are useful not only for 
providing points of compound attachment to the soUd support, but also for allowing 
different groups of molecules to be cleaved from the sohd support under different 
conditions, depending on the nature of the linker. For example, linkers caa be, inter alia, 
electrophilically cleaved, nucleophihcally cleaved, photocleavable, enzymatically cleaved, 
cleaved by metals, cleaved under reductive conditions or cleaved under oxidative 
conditions. 

In some embodiments of the present invention, each compound contains a 
common functional group that mediates covalent attachment to a solid support. In a 
specific embodiment of the invention, the functional group that mediates covalent 
attachment to a solid support varies between the compounds. Compounds can be attached 
on a sohd support in any orientation and distribution that is suitable for the assays of the 
invention. In a further embodiment, compounds are attached or spotted on a solid support 
such as, e.g., a glass slide, with high spatial density and uniform distance between each spot 
so that an array is formed. Each surface is subsequently probed with a compound of 
interest. 

In one embodiment, compounds are appUed directly to a surface, such as, 
e.g. , a glass slide, using a manual transfer technique. In a particular embodiment, the 
compounds are transferred or spotted on a surface from a microtiter plate using a robotic 
arrayer. In another embodiment, the compounds are attached to beads that are subsequently 
transferred to wells in a microtiter plate where the compounds are released before being 
arrayed on a surface using any of the means described above. Any type and size of bead 
can be used to attach compounds of the invention. One skilled in the art would be familiar 
with the bead properties necessary for a specific purpose. In a particular embodiment, the 
bead material is polystyrene. 

In another embodiment, the combinatorial compound libraries can be 
assembled in situ using dynamic combinatorial chemistry as described in European Patent 
Application 1,1 18,359 Al to Lehn; Hue & Nguyen, 2001, Comb. Chem. High Throughput. 
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Screen. 4:53-74; Lehn and Eliseev, 2001, Science 291:2331-2332; Cousins et al 2000, 
Cuir. Opin. Chem. Biol. 4: 270-279; and Karan & Miller, 2000, Drug. Disc. Today 5:67-75 
which are incorporated by reference in their entirety. 

Dynamic combinatorial chemistry uses non-covalent interaction with a target 
5 biomolecule, including but not limited to a protein, RNA, or DNA, to favor assembly of the 
most tightly binding molecule that is a combination of constituent subunits present as a 
mixture in the presence of the biomolecule. According to the laws of thermodynamics, 
when a collection of molecules is able to combine and recombine at equilibrium through 
reversible chemical reactions in solution, molecules, preferably one molecule, that bind 

10 most tightly to a templating biomolecule will be present in greater amount than all other 
possible combinations. The reversible chemical reactions include, but are not limited to, 
imine, acyl-hydrazone, amide, acetal, or ester formation between carbonyl-containing 
compounds and amines, hydrazines, or alcohols; thiol exchange between disulfides; alcohol 
exchange in borate esters; Diels-Alder reactions; thermal- or photoinduced sigmatropic or 

15 electrocycUc rearrangements; or Michael reactions. 

In the preferred embodiment of this technique, the constituent components of 
the dynamic combinatorial compound library are allowed to combine and reach equilibrium 
in the absence of the target RNA and then incubated in the presence of the target RNA, 
preferably at physiological conditions, imtil a second equihbrium is reached. The second, 

20 perturbed, equilibrium (the so-called "templated mixture") can, but need not necessarily, be 
fixed by a fiirther chemical transformation, including but not limited to reduction, 
oxidation, hydrolysis, acidification, or basification, to prevent restoration of the original 
equilibrium when the dynamical combinatorial compound Ubrary is separated fi:om the 
target RNA. 

25 In the preferred embodiment of this technique, the predominant product or 

products of the templated dynamic combinatorial library can separated from the minor 
products and directly identified. In another embodiment, the identity of the predominant 
product or products can be identified by a deconvolution strategy involving preparation of 
derivative dynamic combinatorial libraries, as described in European Patent Application 

30 1,1 18,359 Al, which is incorporated by reference in its entirety, whereby each component 
of the mixture is, preferably one-by-one but possibly group-wise, left out of the mixture and 
the abiUty of the derivative Hbrary mixture at chemical equihbrium to bind the target RNA 
is measured. The components whose removal most greatly reduces the ability of the 
derivative dynamic combinatorial library to bind the target RNA are likely the components 

35 of the predominant product or products in the original dynamic combinatorial library. 

5.5 Library Screening 
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After a target nucleic acid, such as but not limited to RNA or DNA, is 
labeled and a compound library is synthesized or purchased or both, the labeled target 
nucleic acid is used to screen the hbrary to identify compounds that bind to the nucleic acid. 
Screening comprises contacting a labeled target nucleic acid with an individual, or small 
5 group, of the compounds of the compound library. Preferably, the contacting occurs in an 
aqueous solution, and most preferably, under physiologic conditions. The aqueous solution 
preferably stabilizes the labeled target nucleic acid and prevents denaturation or degradation 
of the nucleic acid without interfering with binding of the compounds. The aqueous 
solution can be similar to the solution in which a complex between the target RNA and its 

10 corresponding host cell factor (if known) is formed in vitro. For example, TK buffer, which 
is commonly used to form Tat protein-TAR RNA complexes in vitro, can be used in the 
methods of the invention as an aqueous solution to screen a Hbrary of compounds for RNA 
biuding compoimds. 

Alternatively, compounds are labeled and target RNA molecules are used to 

15 screen the library of labeled compounds. After compounds are labeled, target nucleic acids 
are used to screen the library of labeled compounds to identify those nucleic acids that bind 
to the labeled compounds. Screening comprises contacting a target nucleic acid with an 
individual, or small group, of the compounds of the labeled compound library. Preferably, 
the contacting occurs in an aqueous solution, and most preferably under physiologic 

20 conditions. The aqueous solution preferably stabilizes the target nucleic acid and prevents 
denaturation or degradation of the nucleic acid without interfering with binding of the 
compounds. The aqueous solution can be similar to the solution in which a complex 
between the target RNA and its corresponding host cell factor (if known) is formed in vitro. 
For example, TK buffer, which is conunonly used to form Tat protein-TAR RNA 

25 complexes in vitro, can be used in the methods of the invention as an aqueous solution to 
screen a library of compounds for RNA binding compounds. 

The methods of the present invention for screening a hbrary of compounds 
preferably comprise contacting a compound with a target nucleic acid in the presence of an 
aqueous solution, the aqueous solution comprising a buffer and a combination of salts, 

30 preferably approximating or mimicking physiologic conditions. The aqueous solution 
optionally further comprises non-specific nucleic acids, such as, but not limited to, DNA; 
yeast tRNA; salmon sperm DNA; homoribopolymers such as, but not limited to, poly IC, 
polyA, polyU, and polyC; and non-specific RNA. The non-specific RNA may be an 
unlabeled target nucleic acid having a mutation at the binding site, which renders the 
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unlabeled nucleic acid incapable of interacting with a compound at that site. For example, 
if dye-labeled TAR RNA is used to screen a library, unlabeled TAR RNA having a 
mutation in the uracil 23/cytosine 24 bulge region may also be present in the aqueous 
solution. Without being boimd by any theory, the addition of unlabeled RNA that is 
5 essentially identical to the dye-labeled target RNA except for a mutation at the binding site 
might minimize interactions of other regions of the dye-labeled target RNA with 
compounds or with the solid support and prevent false positive results. 

The solution further comprises a buffer, a combination of salts, and 
optionally, a detergent or a surfactant. The pH of the aqueous solution typically ranges 

10 fix)m about 5 to about 8, preferably from about 6 to about 8, most preferably from about 6.5 
to about 8. A variety of buffers may be used to achieve the desired pH. Suitable buffers 
include, but are not limited to, Tris, Mes, Bis-Tris, Ada, Aces, Pipes, Mopso, Bis-Tris 
propane, Bes, Mops, Tes, Hepes, Dipso, Mobs, Tapso, Trizma, Heppso, Popso, TEA, Epps, 
Tricine, Gly-Gly, Bicine, and sodium-potassium phosphate. The buffering agent comprises 

15 from about 10 mM to about 100 mM, preferably from about 25 mM to about 75 mM, most 
preferably from about 40 mM to about 60 mM buffering agent. The pH of the aqeuous 
solution can be optimized for different screening reactions, depending on the target RNA 
used and the types of compounds in the libraiy, and thorefore, the type and amount of the 
buffer used in the solution can vary from screen to screen. In a preferred embodiment, the 

20 aqueous solution has a pH of about 7.4, which can be achieved using about 50 mM Tris 
buffer. 

In addition to an appropriate buffer, the aqueous solution frirther comprises a 
combination of salts, from about 0 mM to about 100 mM KCl, from about 0 mM to about 1 
M NaCl, and from about 0 mM to about 200 mM MgCla. In a preferred embodiment, the 

25 combination of salts is about 100 mM KCl, 500 mM NaCl, and 10 mM MgCb. Without 
being bound by any theory. Applicant has found that a combination of KCl, NaCl, and 
MgCla stabilizes the target RNA such that most of the RNA is not denatured or digested 
over the course of the screening reaction. The optional concentration of each salt used in 
the aqueous solution is dependent on the particular target RNA used and can be determined 

30 using routine experimentation. 

The solution optionally comprises from about 0.01% to about 0.5% (w/v) of 
a detergent or a surfactant. Without being bound by any theory, a small amount of 
detergent or surfactant in the solution might reduce non-specific binding of the target RNA 
to the solid support and control aggregation and increase stability of target RNA molecules. 

47 



wo 2004/010106 PCT/US2003/023075 
Typical detergents useful in the methods of the present invention include, but are not 
limited to, anionic detergents, such as salts of deoxycholic acid, l-heptanesulfonic acid, N- 
lauiylsarcosine, lauryl sulfate, 1-octane sulfonic acid and taurocholic acid; cationic 
detergents such as benzaJkonium chloride, cetylpyridinium, methylbenzefhonium chloride, 
and decamethonium bromide; zwitteiionic detergents such as C3HAPS, CHAPSO, alkyl 
betaines, alkyl amidoalkyl betaines, N-dodecyl-N,N-dimethyl-3-anmiomo-l- 
propanesulfonate, and phosphatidylcholine; and non-ionic detergents such as n-decyl a-D- 
ghicopyranoside, n-decyl fi-D-maltopyranoside, n-dodecyl B-D-maltoside, n-octyl B-D- 
glucopyranoside, sorbitan esters, n-tetradecyl B-D-maltoside, octylphenoxy 
polyethoxyethanol (Nonidet P-40), nonylphenoxypolyethoxyethanol (NP-40), and tritons. 
Preferably, the detergent, if present, is a nonionic detergent. Typical surfactants useful in 
the methods of the present invention include, but are not limited to, ammonium lauryl 
sulfate, polyethylene glycols, butyl glucoside, decyl glucoside, Polysorbate 80, lauric acid, 
myristic acid, pahnitic acid, potassium palmitate, undecanoic acid, lauryl betaine, and lauryl 
alcohol. More preferably, the detergent, if preseait, is Triton X-lOO and present in an 
amount of about 0.1% (w/v). 

Non-specific binding of a labeled target nucleic acid to compounds can be 
further minimized by treating the bmding reaction with one or more blocking agents. Non- 
specific binding of a unlabeled target nucldc acid to labeled compounds can be further 
nuniniized by treating the binding reaction with one or more blocking agents. In one 
embodiment, the binding reactions are treated with a blockmg agent, e.g., bovine serum 
albumin CTBSA'O, before contactmg with to the labeled target nucleic acid. In another 
embodiment, the binding reactions are treated sequentially with at least two different 
blockmg agents. This blockmg step is preferably performed at room temperature for from 
about 0.5 to about 3 hours. In a subsequent step, the reaction mixture is further treated with 
unlabeled RNA having a mutation at the binding site. This blocking step is preferably 
performed at about 4^C for &om about 12 hours to about 36 hours before addition of the 
dye-labeled target RNA. Preferably, the solution used in the one or more blocking steps is 
substantially similar to the aqueous solution used to screen the Ubrary with the dye-labeled 
target RNA, e.g., in pH and salt concentration. 

Once contacted, the mixture of labeled target nucleic acid and the compound 
is preferably mamtained at 4^C for from about 1 day to about 5 days, preferably from about 
2 days to about 3 days with constant agitation. To identify the reactions m which bmdmg to 
the labeled target nucleic acid occurred, after the incubation period, bound from free 
compounds are determined using any of the metiiods disclosed in Section 5.5 infra. In a 
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Specific embodiment, the complexed target nucleic acid does not need to be separated from 
the free target nucleic acid if a technique (ie,, spectrometry) that diferentiates between 
bound and unbound target nucleic acids is used. 

In another raibodiment, once contacted, the mixture of target nucleic acid 
5 and the labeled compound is preferably maintained at 4°C for from about 1 day to about 5 
days, preferably from about 2 days to about 3 days with constant agitation. To identify the 
reactions in which binding to the target nucleic acid occurred, after the incubation period, 
bound from free compounds are determined using any of the methods disclosed in Section 
5.5 infra. In a specific embodiment, the complexed target nucleic acid does not need to be 

10 separated from the free target nucleic acid if a technique (i.e., spectrometry) that 
diferentiates between bound and unbound target nucleic acids is used. 

The methods for identifying small molecules bound to labeled nucleic acid 
will vary with the type of label on the target nucleic acid. For example, if a target RNA is 
labeled with a visible of fluorescent dye, the target RNA complexes are preferably 

15 identified using a chromatographic technique that separates bound from free target by an 
electrophoretic or size differential technique using individual reactions. The reactions 
corresponding to changes in the migration of the complexed RNA can be cross-referenced 
to the small molecule compound(s) added to said reaction. Alternatively, complexed target 
RNA can be screened en masse and then separated from free target RNA using an 

20 electrophoretic or size differential technique, the resultant complexed target is then 

analyzed using a mass spectrometric technique. In this fashion the bound small molecule 
can be identified on the basis of its molecular weight. In this reaction a priori knowledge of 
the exact molecular weights of all compounds within the library is known. In another 
embodiment, the compounds boxmd to flie target nucleic acid may not require separation 

25 from the unbound target nucleic acid if a technique such as, but not limited to, spectrometry 
is used. 

The methods for identifying labeled small molecules bound to unlabeled 
nucleic acid will vary with the type of label on the compound. For example, if compound is 
labeled with a visible of fluorescent dye, the target RNA complexes are preferably 

30 identified tising a chromatographic technique that separates bound from free target by an 
electrophoretic or size differential technique using individual reactions. The reactions 
corresponding to changes in the migration of the complexed RNA can be cross-referenced 
to the small molecule compound(s) added to said reaction. Alternatively, complexed target 
RNA can be screened en masse and then separated from free target RNA using an 

35 electrophoretic or size differential technique, the resultant complexed target is then 

analyzed using a mass spectrometric technique. In this fashion the bound small molecule 
can be identified on the basis of its molecular weight In this reaction a priori knowledge of 
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the exact molecular weights of all compounds within the library is known. In another 
embodiment, the compounds bound to the target nucleic acid may not require separation 
from the unbound target nucleic acid if a technique such as, but not limited to, spectrometry 
is used. 

5 

5.6 Separation Methods for Screening Compounds 
Any method that detects an altered physical property of a target nucleic acid 
complexed to a compound from the unbound target nucleic acid may be used for separation 
of the complexed and non-complexed target nucleic acids. Methods that can be utilized for 

10 the physical separation of complexed target RNA from unbound target RNA include, but 
are not limited to, electrophoresis, fluorescence spectroscopy, surface plasmon resonance, 
mass spectrometry, scintillation, proximity assay, structure-activity relationships ("SAR") 
by NMR spectroscopy, size exclusion chromatogr^hy, affinity chromatography, 
nanoparticle aggregation^ flow cytometry, manual batch, and suspension of beads in electric 

15 fields. 

In embodiments that use solid support based methods, after the labeled target 
RNA is contacted with the library of compounds immobilized on a solid support 
beads) or the target RNA conjugated or attached to the solid support is contacted with the 
library of detectably labeled compounds, the solid support (e.g., beads) must then be 

20 separated from the unbound target RNA or unbound compounds, respectively, in the liquid 
phase. This can be accomplished by any number of physical means; eg., sedimentation, 
centrifiigation. Thereafter, a number of methods can be used to separate the solid support- 
based hbrary that is complexed with the labeled target RNA from uncomplexed beads in 
order to isolate the compound on the bead. Alternatively, mass spectroscopy and NMR 

25 spectroscopy can be used to simultaneously identify and separate beads complexed to the 
labeled target RNA from uncomplexed beads. 



5.6.1 Electrophoresis 
Methods for separation of the complex of a target RNA bound to a 
30 compound from the unbound RNA comprises any method of electrophoretic separation, 
including but not limited to, denaturing and non-denaturing polyacrylamide gel 
electrophoresis, urea gel electrophoresis, gel filtration, pulsed field gel electrophoresis, two 
dimensional gel electrophoresis, continuous flow electrophoresis, zone electrophoresis, 
agarose gel electrophoresis, and capillary electrophoresis. 
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In a preferred embodiment, an automated electrophoretic system comprising 
a capillary cartridge having a plurality of capillary tubes is used for high-throughput 
screening of compounds bound to target RNA. Such an apparatus for performing 
automated capillary gel electrophoresis is disclosed in U.S. Patent Nos. 5,885,430; 
5 5,916,428; 6,027,627; and 6,063,251, the disclosures of which are incorporated by reference 
in their entireties. 

The device disclosed in U.S. Patent No. 5,885,430, which is incorporated by 
reference in its entirety, allows one to simultaneously introduce samples into a plurality of 
capillary tubes directly from microtiter trays having a standard size. U.S. Patent No. 

10 5,885,430 discloses a disposable capillary cartridge which can be cleaned between 

electrophoresis runs, the cartridge having a plurality of capillary tubes. A first end of each 
capillary tube is retained in a mounting plate, the first ends collectively forming an array in 
the mounting plate. The spacing between the first ends corresponds to the spacing between 
the centers of the wells of a microtiter tray having a standard size. Thus, the first ends of 

15 the capillary tubes can simultaneously be dipped into the samples present in the tray's 

wells. The cartridge is provided with a second mounting plate in which the second ends of 
the capillary tubes are retained. The second ends of the capillary tubes are arranged in an 
array which corresponds to the wells in the microtiter tray, which allows for each capillary 
tube to be isolated from its neighbors and therefore free from cross-contamination, as each 

20 end is dipped into an individual well. 

Plate holes may be provided in each mounting plate and the capillary tubes 
inserted through these plate holes. In such a case, the plate holes are sealed airtight so that 
the side of the mounting plate having the exposed capillary ends can be pressurized. 
Application of a positive pressure in the vicinity of the capillary openings in this mounting 

25 plate allows for the introduction of air and fluids during electrophoretic operations and also 
can be used to force out gel and other materials from the capillary tubes during 
reconditioning. The capillary tubes may be protected from damage using a needle 
comprising a cannula and/or plastic tubes, and the like when they are placed in these plate 
holes. When metallic cannula or the like are used, they can serve as electrical contacts for 

30 current flow during electrophoresis. In the presence of a second mounting plate, the second 
mounting plate is provided with plate holes through which the second ends of the capillary 
tubes project. In this instance, the second mounting plate serves as a pressure contaitmient 
member of a pressure cell and the second ends of the capillary tubes communicate with an 
internal cavity of the pressure cell. The pressure cell is also formed with an inlet and an 
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outlet. Gels, buffer solutions, cleaning agents, and the like may be introduced into the 
internal cavity through the inlet, and each of these can simultaneously enter the second ends 
of the capillaries. 

In another preferred embodiment, the automated electrophoretic system can 
5 comprise a chip system consisting of complex designs of interconnected channels that 

perfomi and analyze enzyme reactions using part of a channel design as a tiny, continuously 
operating electrophoresis material, where reactions with one sample are going on in one 
area of the chip while electrophoretic separation of the products of another sample is taking 
place in a different part of the chip. Such a system is disclosed in U.S. Patent Nos. 
10 5,699,157; 5,842,787; 5,869,004; 5,876,675; 5,942,443; 5,948,227; 6,042,709; 6,042,710; 
6,046,056; 6,048,498; 6,086,740; 6,132,685; 6,150,119; 6,150,180; 6,153,073; 6,167,910; 
6,171,850; and 6,186,660, the disclosures of which are incorporated by reference in their 
entireties. 

The system disclosed in U.S. Patent No. 5,699,157, which is hereby 

15 incorporated by reference in its entirety, provides for a microfluidic system for high-speed 
electrophoretic analysis of subject materials for appUcations in the fields of chemistry, 
biochemistry, biotechnology, molecular biology and numerous other areas. The system has 
a channel in a substrate, a light source and a photoreceptor. The channel holds subject 
materials in solution in an electric field so that the materials move throu^ the channel and 

20 separate into bands according to species. The light source excites fluorescent light in the 
species bands and the photoreceptor is arranged to receive the fluorescent light from the 
bands. The system further has a means for masking the channel so that the photoreceptor 
can receive the fluorescent light only at periodically spaced regions along the channel. The 
system also has an unit coimected to analyze the modulation firequencies of Ught intensity 

25 received by the photoreceptor so that velocities of the bands along the channel are 
determined, which allows the materials to be analyzed. 

The system disclosed in U.S. Patent No. 5,699,157 also provides for a 
method of performing high-speed electrophoretic analysis of subject materials, which 
comprises the steps of holding the subject materials in solution in a channel of a 

30 microfluidic system; subjecting the materials to an electric field so that the subject materials 
move through the channel and separate into species bands; directing Kght toward the 
channel; receiving light firom periodically spaced regions along the channel simultaneously; 
and analyzing the frequencies of Ught intensity of the received ligjit so that velocities of the 
bands along the chaimel can be determined for analysis of said materials. The 
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detennination of the velocity of a species band deteimines the electrophoretic mobiUty of 
the species and its identification. 

U.S. Patent No. 5,842,787, which is hereby incorporated by reference in its 
entirety, is generally directed to devices and systems employ channels having, at least in 
5 part, depths that are varied over those which have been previously described (such as the 
device disclosed in U.S. Patent No. 5,699,157), wherem said channel depths provide 
numerous beneficial and unexpected results such as but not limited to, a reduction in sample 
perturbation, reduced non-specific sample nuxture by diffusion, and increased resolution. 

In another embodiment, the electrophoretic method of separation comprises 
10 polyacrylamide gel electrophoresis. In a preferred embodiment, the polyacrylamide gel 
electrophoresis is non-denaturing, so as to differentiate the mobilities of the target RNA 
bound to a compound firom free target RNA. If the polyacrylamide gel electrophoresis is 
denaturing, then the target RNAxompound complex must be cross-linked prior to 
electrophoresis to prevent the disassociation of the target RNA from the compound during 
15 electrophoresis. Such techniques are well known to one of skill in the art. 

In one embodiment of the method, the binding of compounds to target 
nucleic acid can be detected, preferably in an automated fashion, by gel electrophoretic 
analysis of interference footprinting. RNA can be degraded at specific base sites by 
enzymatic methods such as ribonucleases A, U2, CL3, Ti, Phy M, and B. cereus or chemical 
20 methods such as diefhylpyrocarbonate, sodium hydroxide, hydrazine, piperidine formate, 
dimethyl sulfate, [2,12-dimethyl-3,7,ll,17-tetraazacyclo[11.3.1]heptadeca- 
l(17),2,ll,13,15-pentaenato] nickel(II) (NiCR), cobalt(II)chloride, or iron(ir) 
ethylenediaminetetraacetate (Fe-EDTA) as described for example in Zheng et aL, 1999, 
Biochem. 37:2207-2214; Latham & Cech, 1989, Science 2A5:216'2%2\ and Sambrook et 
IS aU 2001, in Molecular Cloning: A Laboratory Manual, pp 12.61-12.73, Cold Spring Harbor 
Laboratory Press, and the references cited therein, which are hereby incorporated by 
reference in their entireties. 

The specific pattern of cleavage sites is determined by the accessibility of particular 
bases to the reagent employed to initiate cleavage and, as such, is therefore is determined by 
30 the three-dimensional structure of the RNA. The interaction of small molecules with a 
target nucleic acid can change the accessibiUty of bases to these cleavage reagents both by 
causing conformational changes in the target nucleic acid or by covering a base at the 
binding interface. When a compound binds to the nucleic add and changes the accessibility 
of bases to cleavage reagents, the observed cleavage pattem will change. This method can 
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be used to identify and characterize the binding of small molecules to KNTA as described, 
for example, by Prudent et al, 1995, J. Am. Chem. See. 1 17:10145-10146 and Mei et ai, 
1998, Biochem. 37:14204-14212. 

In the preferred embodiment of this technique, the detectably labeled target 
5 nucleic acid is incubated with an individual compound and then subjected to treatment with 
a cleavage reagent, either enzymatic or chemical. The reaction mixture can be preferably 
be examined directly, or treated further to isolate and concentrate the nucleic acid. The 
fragments produced are separated by electrophoresis and the pattem of cleavage can be 
compared to a cleavage reaction performed in the absence of compound. A change in the 
10 cleavage pattem directly indicates that the compound binds to the target nucleic acid. 
Multiple compounds can be exanained both in parallel and serially. 

Other embodiments of electrophoretic separation include, but are not Mmited 
to urea gel electrophoresis, gel filtration, pulsed field gel electrophoresis, two dimensional 
gel electrophoresis, continuous flow electrophoresis, zone electrophoresis, and agarose gel 
15 electrophoresis. 

5.6.2 Flaorescence Spectroscopy 
In a preferred embodiment, fluorescence polarization spectroscopy, an 

optical detection method that can differentiate the proportion of a fluorescent molecule that 

is either bound or unbound in solution (e.g., the labeled target nucleic acid of the present 
20 invention), can be used to read reaction results without electrophoretic separation of the 

samples. Fluorescence polarization spectroscopy can be used to read the reaction results in 

the chip system disclosed in U.S. Patent Nos. 5,699,157; 5,842,787; 5,869,004; 5,876,675; 

5,942,443; 5,948,227; 6,042,709; 6,042,710; 6,046,056; 6,048,498; 6,086,740; 6,132,685; 

6,150,119; 6,150,180; 6,153,073; 6,167,910; 6,171,850; and 6,186,660, the disclosures of 
25 which are incorporated by reference in their entireties. The application of fluorescence 

polarization spectroscopy to the chip system disclosed in the U.S. Patents hsted supra is 

fast, efficient, and well-ad^ted for high-throughput screening. 

Tn another embodiment, a compound that has an affinity for the target 

nucleic acid of interest can be labeled with a fluorophore to screen for compounds that bind 
30 to the target nucleic acid. For example, a pyrene-containing aminoglycoside analog was 

used to accurately monitor antagonist binding to a prokaryotic 1 68 rRNA A site (which 

comprises the natural target for aminoglycoside antibiotics) in a screen using a fluorescence 

quenching technique in a 96-well plate format (Hamas2dd & Rando, 1998, Anal. Biochem. 

261(2):183-90). 
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In another embodiment, fluorescence resonance energy transfer (FRET) can 
be used to screen for compounds that bind to the target nucleic acid. FRET, a characteristic 
change in fluorescence, occurs when two fluorophores with overlapping emission and 
excitation wavelength bands are held together in close proxinaity, such as by a binding 
5 event. In the preferred embodiment, the fluorophore on the target nucleic acid and the 
fluorophore on the compounds will have overlapping excitation and emission spectra such 
that one fluorophore (the donor) transfers its emission energy to excite the other fluorophore 
(the acceptor). The acceptor preferably emits light of a different wavelength upon relaxing 
to the ground state, or relaxes non-radiatively to quench fluorescence. FRET is very 

10 sensitive to the distance between the two fluorophores, and allows measurement of 

molecular distances less than 10 mn. For example, U.S, Patent 6,337,183 to Arenas et al^ 
which is incorporated by reference in its entirety, describes a screen for compounds that 
bind RNA that uses FRET to measure the effect of compoimds on the stability of a target 
RNA molecule where the target RNA is labeled with both fluorescent acceptor and donor 

15 molecules and the distance between the two fluorophores as determined by FRET provides 
a measure of the folded structure of the RNA. Matsumoto et aL (2000, Bioorg. MeA Chem. 
Lett. 10:1857-1861) describe a system where a peptide that binds to HTV-l TAR RNA is 
labeled on one end with a fluorescein fluorophore and a tetramethylrhodamine on the other 
end. The conformational change of the peptide upon binding to the RNA provided a FRET 

20 signal to screen for compounds that bound to the TAR RNA. 

In the preferred embodiment, both the target nucleic acid and a compound 
that has an affinity for the target nucleic add of interest are labeled with fluorophores with 
overlapping emission and excitation q)ectra (donor and acceptor), including but not limited 
to fluorescein and derivatives, rhodamine and dmvatives, cyanine dyes and derivatives, 

25 bora-3a,4a-diaza-s-indacene (BODIPY®) and derivatives, pyrene, nanoparticles, or 

non-fluorescent quenching molecules. Binding of a labeled compound to the target nucleic 
acid can be identified by the change in observable fluorescence as a result of FRET. 

If the target nucleic acid is labeled with the donor fluorophore, then the 
compounds are labeled with the acceptor fluorophore. Conversely, if the target nucleic acid 

30 is labeled with the acceptor fluorophore, then the compounds are labeled with the donor 
fluorophore. A wide variety of labels may be used, with the choice of label depending on 
sensitivity required, ease of conjugation with the compound, stability requirements, 
available mstrumentation, and disposal provisions. The fluorophore on the target nucleic 
acid must be in close proximity to the binding site of the compounds, but should not be 
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incorporated into a target nucleic acid at the specific binding site at which compounds are 
likely to bind, since the presence of a covalently attached label might interfere sterically or 
chemically with the binding of the compounds at this site. 

In yet another embodiment, homogeneous time-resolved fluorescence 
5 CUTRF") techniques based on time-resolved energy transfer from lanthanide ion 

complexes to a suitable acceptor species can be ad£^ted for high-throughput screening for 
inhibitors of KNArprotein complexes (Hemmiia, 1999, J. BiomoL Screening 4:303-307; 
Mathis, 1999, J. BiomoL Screening 4:309-313). HTRF is similar to fluorescence resonance 
energy transfer usmg conventional organic dye pairs, but has several advantages, such as 

10 increased sensitivity and efi&ciency, and background elimination (Xavier et aL, 2000, 
Trends Biotechnol. 18(8):349-356). 

It is also contemplated that the target RNA may be labeled with a 
fluorophore and the compounds in the library labeled with a quencher of the fluorophore, or 
alternatively, the target RNA labeled with a quencher of the fluorophore and the compounds 

15 in the library labeled with a fluorophore, so that when a compound and target RNA bind, 
the fluorescent signal of the fluorophore is quenched. 

Fluorescrace spectroscopy has traditionally been used to characterize DNA- 
protein and protein-protein interactions, but fluorescence spectroscopy has not been widely 
used to characterize RNA-protein interactions because of an interfering absorption of SNA 

20 nucleotides with the intrinsic tryptophan fluorescence of proteins pCavier et al , 2000, 

Trends Biotechnol. 18(8):349-356.). However, fluorescence spectroscopy has been used in 
studying the single tryptophan residue within the arginine-rich RNA-binding domain of Rev 
protein and its intemction with the RRE in a time-resolved fluorescence study (Kwon & 
Carson, 1998, Anal. Biochem. 264:133-140). Thus, in this invention, fluorescence 

25 spectroscopy is less preferred if the compounds or peptides or proteins possess intrinsic 
tryptophan fluorescence. However, fluorescence spectroscopy can be used for compounds 
that do not possess intriasic fluorescence. 

5.6.3 Surface Plasmon Resonance f"SPR") 

Surface plasmon resonance (SPR) can be used for determining kinetic rate 
30 constants and equilibrium constants for macromolecular interactions by followiag the 
association project in "real time" (Schuck, 1997, Annu. Rev. Biophys. Biomol. Struct. 
26:541-566). 

The principle of SPR is summarized by Xavier et al (Trends Biotechnol., 
2000, 18(8):349-356) as follows. Total intemal reflection occurs at the boundary between 
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two substances of different refractive index. The incident light's electromagnetic field 
penetrates beyond the interface as an evanescent wave, which extends a few hundred 
nanometers beyond the surface into the medium. Insertion of a thin gold foil at the interace 
produced SPR owing to the absorption of the energy from the evanescent wave by free 
5 electron clouds of the metal (plasmons). As a result of this absorbance, there is a drop in 
the intensity of the reflected light at a particular angle of incidence. The evanescent wave 
profile depends exquisitely on the refiractive index of the medium it probes. Thus, the angle 
at which absorption occurs is very sensitive to the refiractive changes in the external 
medium. All proteins and nucleic acids are known to change the refractive index of water 

10 by a similar amount per unit mass, irrespective of their amino acid or nucleotide 

composition (the refractive index change is different for proteins and nucleic acids). When 
the protein or nucleic acid content of the layer at the sensor changes, the refractive index 
also changes. Typically, one member of a complex is immobihzed in a dextran layer and 
then the other member is introduced into the solution, either in a flow cell (Biacore AB, 

15 Uppsala, Sweden) or a stirred cuvette (Affinity Sensors, Santa Fe, New Mexico). It has 
been determined that there is a linear correlation between the surface concentration of 
protein or nucleic acid and the shift in resonance angle, which can be used to quantitate 
kinetic rate constants and/or the equilibriimi constants. 

In the present invention, the target RNA may be immobilized to the sensor 

20 surface through a streptavidin-biotin linkage, the linkage of which is disclosed by Crouch et 
al (Methods Mol, Biol., 1999, 1 18: 143-160). The RNA is biotinylated either during 
synthesis or post-synthetically via the conversion of the 3' terminal ribonucleoside of the 
RNA into a reactive free amino group or using a T7 polymerase incorporated guanosine 
monophosphorothioate at the 5' end. SPR has been used to determine the stoichiometry 

25 and affinity of the interaction between the HIV Rev protein and the RRE (Van Ryk & 
Venkatesan, 1999, J. Biol. Chem. 274:17452-17463) and the aminoglycoside antibiotics 
with RRE and a model RNA derived from the 16S ribosomal A site, respectively (Hendrix 
et al, 1997, J. Am. Chem. Soc. 119:3641-3648; Wong et al, 1998, Chem. Biol. 5:397-406). 

In one embodiment of the present invention, the target nucleic acid can be 

30 immobilized to a sensor surface (e.^., by a streptavidin-biotin linkage) and SPR can be used 
to (a) determine whether the target RNA binds a compound and (b) further characterize the 
binding of the target nucleic adds of the present invention to a compound. 

5.6.4 Mass Spectrometry 
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An automated method for analyzing mass spectrometer data which can 
analyze complex mixtures containing many thousands of components and can correct for 
background noise, multiply charged peaks and atomic isotope peaks is described in U.S. 
Patent No. 6, 147,344, which is hereby incorporated by reference in its entirety. The system 
5 disclosed in U.S. Patent No. 6,147,344 is a method for analyzing mass spectrometer data in 
which a control sample measurement is performed providing a background noise check. 
The peak height and width values at each m/z ratio as a function of time are stored in a 
memory. A mass spectrometer operation on a material to be analyzed is performed and the 
peak height and width values at each m/z ratio versus tune are stored in a second memory 

10 location. The mass spectrometer operation on the material to be analyzed is repeated a 
fixed number of times and the stored control sample values at each m/z ratio level at each 
time increment are subtracted from each corresponding one from the operational runs, thus 
producing a difference value at each mass ratio for each of the multiple runs at each time 
increment. If the MS value minus the background noise does not exceed a preset value, the 

15 m/z ratio data point is not recorded, thus eliminating backgroimd noise, chemical noise and 
false positive peaks from the mass spectrometer data. The stored data for each of the 
multiple runs is then compared to a predetermined value at each m/z ratio and the resultant 
series of peaks, which are now determined to be above the background, is stored in the m/z 
points in which the peaks are of significance. 

20 One possibility for the utilization of mass spectrometry in high throughput 

screening is the integration of SPR with mass spectrometry. Approaches that have been 
tried are direct analysis of the analyte retained on the sensor chip and mass spectrometry 
with the eluted analyte (Sonksen et al, 1998, Anal. Chem. 70:2731-2736; Nelson & Krone, 

1999, J. Mol. Recog. 12:77-93). Further developments, especially iq the interfacing of the 
25 sensor chip with the mass spectrometer and in reusing the sensor chip, are required to make 

SPR combined with mass spectroscopy a higji-throughput method for biomolecular 
interaction analysis and the screening of targets for small molecule inhibitors (Xavier et al, 

2000, Trends Biotechnol. 18(8):349-356). 

In one embodiment of the present invention, the target nucleic acid 
30 complexed to a compound can be determined by any of the mass spectrometry processed 
described supra. Furthermore, mass spectrometry can also be used to elucidate the structure 
of the compound. 

5.6.5 Scintillation Proximity Assay f^^SPA'^ 
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Scintillation proximity assay ("SPA") is a method that can be used for 
screening small molecules that bind to the target RNAs. SPA would involve radiolabeling 
either the target KNA or the compound and then quantitating its binding to the other 
member to a bead or a surface impregnated with a scintillaat (Cook, 1996, Drug Discov. 
5 Today 1 :287-294). Currently, fluorescence-based techniques are preferred for high- 
throughput screening (Pope et al, 1999, Drug Discov. Today 4:350-362). 

Screening for small molecules that inhibit Tat peptiderTAR RNA interaction 
has been performed with SPA, and inhibitors of the interaction were isolated and 
characterized (Mei etaL, 1997, Bioorg. Med. Chem. 5:1173-1184; Mei etcd., 1998, 
10 Biochemistry 37:14204-14212). A similar approach can be used to identify small 

molecules that directly bind to a preselected target RNA element in accordance with the 
invention can be utilized. 

SPA can be adapted to high throughput screening by the availabihty of 
microplates, wherein the scintillant is directly incorporated into the plastic of the microtiter 
15 wells (Nakayama et aL, 1998, J. Biomol. Screening 3:43-48). Thus, one embodiment of the 
present invention comprises (a) labeling of the target nucleic acid with a radioactive or 
fluorescent label; (b) contacted the labeled nucleic acid with compounds, wherein each 
compound is in a microtiter well coated with scintillant and is tethered to the microtiter 
weQ; and (c) identifying and quantifying the compounds bound to the target nucleic acid 
20 with SPA, wherein the compound is identified by virtue of its location in the microplate. 

5.6.6 Stracture>Activitv Relationships ("SAR'^) by NMR Spectroscopy 

NMR spectroscopy is a valuable technique for identifying complexed target 
nucleic acids by qualitatively determining changes in chemical shift, specifically from 
distances measured using relaxation effects, and NMR-based approaches have been used in 

25 the identification of small molecule binders of protein drug targets QCavier et al , 2000, 
Trends Biotechnol. 18(8):349-356). The determination of structure-activity relationships 
("SAR") by NMR is the first method for NMR described in which small molecules that 
bind adjacent subsites are identified by two-dimentional ^H-^^ spectra of the target protein 
(Shuker et al, 1996, Science 274:1531-1534). The signal from the bound molecule is 

30 monitored by employing line broadening, transferred NOEs and pulsed field gradient 
difiusion measurements (Moore, 1999, Curr. Opin. Biotechnol. 10:54-58). A strategy for 
lead generation by NMR using a library of small molecules has been recently described 
(Fejzo et al, 1999, Chem. Biol. 6:755-769). 
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In one embodiment of the present invention, the target nucleic acid 
complexed to a compound can be determined by S AR by NMR. Furthermore, SAR by 
NMR can also be used to elucidate the structure of the compoimd. 

5.6.7 Size Exclasion Chromatography 

5 hi another embodiment of the present invention^ size-exclusion 

chromatography is used to purify compounds that are bound to a target nucleic from a 
complex mixture of compounds. Size-exclusion chromatography separates molecules based 
on their size and uses gel-based media comprised of beads with specific size distributions. 
When applied to a column, this media settles into a tightly packed matrix and forms a 

10 complex array of pores. Separation is accomplished by the inclusion or exclusion of 
molecules by these pores based on molecular size. Small molecules are included into the 
pores and, consequently, their migration through the matrix is retarded due to tiie added 
distance they must travel before elution. Large molecules are excluded from the pores and 
migrate with the void volume when ^pUed to the matrix. Li the present invention, a target 

15 nucleic acid is incubated with a mixture of compounds while firee in solution and allowed to. 
reach equilibrium. When applied to a size exclusion column, compounds free in solution 
are retained by the column, and compounds boimd to the target nucleic acid are passed 
through the column. In a preferred embodiment, spin columns commonly used for 
"desalting" of nucleic acids will be employed to separate bound from unbound compoxmds 

20 (e.g., Bio-Spin columns manufactured by BIO-RAD). In another embodiment, the size 
exclusion matrix is packed into multiwell plates to allow high throughput separation of 
mixtures (e.g., PLASMID 96-well SEC plates manufactured by MiUipore). 

5.6.8 Affinity Chromatography 

hi one embodiment of the present invention, a£5nity capture is used to purify 
25 compounds that are boimd to a target nucleic acid labeled with an afGnity tag from a 

complex mixture of compounds. To accomplish this, a target nucleic acid labeled with an 
affinity tag is incubated with a mixture of compounds while free in solution and then 
captured to a solid support once equilibrium has been established; altematively, target 
nucleic acids labeled with an affinity tag can be captured to a soUd support first and then 
30 allowed to reach equilibriimi with a mixture of compounds. 

The solid support is typically comprised of, but not limited to, cross-linked 
agarose beads that are coupled with a ligand for the affinity tag. Altematively, the solid 
support may be a glass, siUcon, metal, or carbon, plastic (polystyrene, polypropylene) 
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sinrface with or without a self-assembled monolayer (SAM) either with a covalently 
attached ligand for the affinity tag, or with inherent affinity for the tag on the target nucleic 
acid. 

Once the complex between the target nucleic acid and compound has 
5 reached equilibriivn and has been captured, one skilled in the art will appreciate that the 
retention of bound compounds and removal of unbound compounds is facilitated by 
washing the solid support with large excesses of binding reaction buffer. Furthermore, 
retention of higih a£6baity con^oimds and removal of low affinity compounds can be 
accomplished by a number of means that increase the stringency of washing; fliese means 

10 include, but are not limited to, increasuig the number and duration of washes, raising the 
salt concentration of the wash buffer, addition of detergent or surfactant to the wash buffer, 
and addition of non-specific competitor to the wash buffer. 

Iq one embodiment, the compounds themselves are detectably labeled with 
fluorescent dyes, radioactive isotopes, or nanoparticles. When the compounds are applied 

15 to the captured target nucleic acid in a spatially addressed fashion (e,g, , in separate wells of 
a 96-weU microplate), binding between the compounds and the target nucleic acid can be 
determined by the presence of the detectable label on the compoimd using fluorescence. 

Following the removal of unbound compounds, bound compounds with high 
afSnity for the target nucleic acid can be eluted firom the immobilized target nucleic acids 

20 and analyzed. The elution of compounds can be accomplished by any means that break the 
non-covalent interactions between the target nucleic acid and compound. Means for elution 
include, but are not Umited to, changing the pH, changing the salt concentration, the 
q)plication of organic solvents, and the application of molecules that compete with the 
bound ligand. In a preferred embodiment, the means employed for elution will release the 

25 compound fix)m the target RNA, but will not effect the interaction between the affinity tag 
and the solid support, thereby achieving selective elution of compound. Moreover, a 
preferred embodiment will employ an elution buffer that is volatile to allow for subsequent 
concentration by lyophilization of the eluted compound (e.g., 0 M to 5 M ammonium 
acetate). 

30 In another embodiment of the invention, the target RNA can be labeled with 

biotin, an antigen, or a ligand. Library beads complexed to the target RNA can be separated 
from uncomplexed beads using affinity techniques designed to c^ture the labeled moiety 
on the target RNA. For example, a solid support, such as but not limited to, a colmnn or a 
well in a microwell plate coated with avidin/streptavidin, an antibody to the antigen, or a 
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receptor for the ligand can be used to capture or immobilize the labeled beads. Complexed 
RNA may or may not be irreversibly bound to the bead by a further transformation between 
the bound RNA and an additional moiety on the surface of the bead. Such linking methods 
include, but are not limited to: photochemical crosslinking between RNA and bead-bound 
5 molecules such as psoralen, thymidine or uridine derivates either present as monomers, 
oligomers, or as a partially complementary sequence; or chemical Hgation by disulfide 
exchange, nitrogen mustards, bond formation between an electrophile and a nucleophile, or 
alkylating reagents. See, e.g., Ihtemational Patent PubUcation WO/0146461, the contents 
of which are hereby incorporated by reference. The unbound library beads can be removed 

1 0 after the binding reaction by washing the solid phase. If the RNA is irreversibly bound to 
the bead, compounds can be isolated from the bead following destruction of the bound 
RNA by preferably, but not limited to, enzymatic or chemical (e.g., alkaline hydrolysis) 
degradation. The library beads bound to the sohd phase can then be eluted with any 
solution that dismpts the binding between the labeled target RNA and the solid phase. Such 

15 solutions include high salt solutions, low pH solutions, detergents, and chaotropic 

denaturants, and are well known to one of skill in the art. In another embodiment, the 
compounds can be eluted from the sohd phase by heat. 

In one embodiment, the library of compounds can be prepared on magnetic 
beads, such as Dynabeads Strq)tavidin (Dynal Biotech, Oslo, Norway). The magnetic bead 

20 library can then be mixed with the labeled target RNA under conditions that allow binding 
to occur. The separation of the beads from unbound target RNA in the liquid phase can be 
accompUshed using a magnet. After removal of the magnetic field, the bead complexed to 
the labeled RNA may be separated from uncomplexed library beads via the label used on 
the target RNA; e.g., biotinylated target RNA can be captured by avidin/streptavidin; target 

25 RNA labeled with antigen can be captured by the appropriate antibody; target RNA labeled 
with ligand can be captured using the appropriate immobilized receptor. The captured 
library bead can then be eluted with any solution that dismpts the binding between the 
labeled target RNA and the immobilized surface. Such solutions include high salt 
solutions, low pH solutions, detergents, and chaotropic denaturants, and are well knovm to 

30 one of skill in the art. Complexed RNA may or may not be irreversibly bound to the bead 
by a further transformation between the bound RNA and an additional moiety on the 
surface of the bead. Such linking methods include, but are not linrited to: photochemical 
crosslinking between RNA and bead-bound molecules such as psoralen, thymidine or 
uridine derivates either present as monomers, oligomers, or as a partially complementary 
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sequence; or chemical ligation by disulfide exchange, nitrogen mustards, bond formation 
between an electrophile and a nucleophile, or alkylating reagents. See, e.g., Intemational 
Patent Publication WO/0146461, the contents of which are hereby incorporated by 
reference. If the RNA is irreversibly bound to the bead, compounds can be isolated fi*om 
5 the bead following destruction of the bound RNA by enzymatic degradation including, but 
not limited to, ribonucleases A, U2, CL3, Ti, Phy M, B. cereus or chemical degradation 
mcluding, but not limited to, piperidine-promoted backbone cleavage of abasic sites 
(following treatment with sodium hydroxide, hydrazine, piperidine formate, or dimethyl 
sulfate), or metal-assisted (e.g. nickel(II), cobalt(II), or iron(II)) oxidative cleavage. 

10 In another embodiment, the preselected target RNA can be labeled with a 

heavy metal tag and incubated with the library beads to allow binding of the compounds to 
the target RNA. The separation of the labeled beads firom unlabeled beads can be 
accompUshed using a magnetic field. After removal of the magnetic field, the compound 
can be eluted with any solution that disrupts the bindiug between the preselected target 

15 RNA and the compound. Such solutions include high salt solutions, low pH solutions, 
detergents, and chaotropic denaturants, and are well known to one of skill in the art. In 
another embodiment, the compounds can be eluted firom the solid phase by heat. 



5.6.9 Nanoparticle Aggregation 

20 In one embodiment of the present invention, bofii the target nucleic acid and 

the compounds are labeled with nanoparticles. A nanoparticle is a cluster of ions with 
controlled size from 0.1 to 1000 nm comprised of metals, metal oxides, or semiconductors 
including, but not limited to Ag2S, ZnS, CdS, CdTe, Au, or TiOa. Methods for the 
attachment of nucleic acids and small molecules to nanoparticles are well know to one of 

25 skill in the art (reviewed in Niemeyer, 2001, Angew. Chem. hit. Ed. 40:4129-4158. The 
references cited therein are hereby incorporated by reference in their entireties). In 
particular, if multiple copies of the target nucleic acid are attached to a single nanoparticle 
and multiple copies of a compound are attached to another nanoparticle, then interaction 
between the compound and target nucleic acid will induce aggregation of nanoparticles as 

30 described, for example, by Mitchel et al 1999, J. Am. Chem. Soc. 121 :8122-8123. The 
aggregate can be detected by changes in absorbance or fluoresc^ce spectra and physically 
separated firom the unbound components through filtration or centrifiigation. 

5.6.10 Flow Cytometry 
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In a preferred embodiment, the complexed and non-complexed target nucleic 
acids are separated by flow cytometry methods. Flow cytometers for sorting and examining 
biological cells are well known in the art; this technology can be applied to separate the 
labeled Ubrary beads from milabeled beads. Known flow cytometers are described, for 
5 example, in U.S. Patent Nos, 4,347,935; 5,464,581; 5,483,469; 5,602,039; 5,643,796; and 
6,21 1,477; the entire contents of which are incorporated by reference herein. Other known 
flow cytometers are the FACS Vantage™ system manufactured by Becton Dickinson and 
Company, and the COP AS™ system manufactured by Union Biometrica. 

A flow cytometer typically includes a sample reservoir for receiving a 

10 biological sample. The biological sample contains particles (hereinafter referred to as 
"beads") that are to be analyzed and sorted by the flow cytometer. Beads are transported 
from the sample reservoir at high speed (>100 beads/second) to a flow cell in a stream of 
hquid "sheath" fluid. High-frequency vibrations of a nozzle that directs the stream to the 
flow cell causes the stream to partition and form ordered droplets, with each droplet 

15 containing a single bead. Physical properties of beads can be measured as they intersect a 
laser beam within the cytometer flow cell. As beads move one by one througji the 
interrogation point, they cause the laser Ught to scatter and fluorescent molecules on the 
labeled beads (i.e., beads complexed with labeled target RNA) become excited. 
Alternatively, if the target nucleic acid is labeled with an inorganic nanoparticle, the beads 

20 complexed with bound target nucleic acid can be distinguished not only by unique 

fluorescent properties but also on the basis of spectrometric properties (e.g, including but 
not limited to increased optical density due to the reduction of Ag"*^ ions in the presence of 
gold nanoparticles (see, e.g., Taton etal Science 2000, 289: 1757-1760)). 

5.6.11 Manual Batch 

25 In one embodiment, a for separating complexed beads. To e:^lore a 

bead-based library within a reasonable time period, the primary screens should be operated 
with sufficient througihput. To do this, the target nucleic acid is labeled with a dye and then 
incubated with the combinatorial library. An advantage of such an assay is the fast 
identification of active Hbrary beads by color change. In the lower concentrations of the 

30 dye-labeled target molecule, only those library beads that bind the target molecules most 
tightly are detected because of higher local concentration of the dye. When washed and 
plated into a liquid monolayer, colored beads are easily separated from non-colored beads 
with the aid of a dissecting microscope. One of the problems associated with this method 
could be the interaction between the red dye and library substrates. Control experiments 
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using the dye alone and dye attached to mutant RNA sequences with the libraries are 
perfonned to eliminate this possibility. 



5.6.12 Snspension of Beads in Electric Fields 

5 In another embodiment of the invention, library beads bound to the target 

KNA can be separated from unbound beads on the basis of the altered charge properties due 
to RNA binding. In a preferred embodiment of this technique, beads are separated from 
unbound nucleic acid and suspended, preferably but not only, in the presence of an electric 
field where the bound RNA causes the beads bound to the target RNA to migrate toward the 

10 anode, or positive, end of the field. 

Beads can be preferentially suspended ia solution as a coUoidal suspension 
with the aid of detergents or surfactants. Typical detergents useful in the methods of the 
present invention include, but are not limited to, anionic detergents, such as salts of 
deoxycholic acid, 1-heptanesulfonic acid, N-laurylsarcosine, lauryl sulfate, 1 -octane 

15 sulfonic acid, carboxymethylceUulose, carrageenan, and taurocholic acid; cationic 

detergents such as benzalkoniiim chloride, cetylpyridinium, methylbenzethonium chloride, 
and decamethonium bromide; zwitterionic detergents such as CHAPS, CHAPSO, alkyl 
betaines, aDcy amidoalkyl betaines, 

N-dodecyl-N,N-dimethyl-3-ammonio-l-propanesulfonate, and phosphatidylcholine; and 
20 non-ionic detergents such as n-decyl a-D-glucopyranoside, n-decyl-D-maltopyf anoside, 
n-dodecyl -D-maltoside, n-octyl -D-glucopyranoside, soibitan esters, n-tetradecyl 
-D-maltoside and tritons. Preferably, the detergent, if present, is a nonionic detergent 
Typical surfactants useful in the methods of the present invention include, but are not 
limited to, ammonium lauryl sulfate, polyethylene glycols, butyl glucoside, decyl glucoside, 
25 Polysorbate 80, lauric acid, myristic acid, palmitic acid, potassium palmitate, undecanoic 
acid, lauryl betaiae, and lauryl alcohol. 

Complexed RNA may or may not be irreversibly boimd to the bead by a 
further transformation between the bound RNA and an additional moiety on the surface of 
the bead. Such linking methods include, but are not limited to: photochemical crosslinking 
30 between RNA and bead-bound molecules such as psoralen, thymidine or uridine derivates 
either present as monomers, oligomers, or as a partially complementary sequence; or 
chemical ligation by disulfide exchange, nitrogen mustards, bond formation between an 
electrophile and a nucleophile, or alkylating reagents. 
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If the RNA is irreversibly bound to the bead, compounds can be isolated 
from the bead following destruction of the bound RNA by enzymatic degradation including, 
but not limited to, ribonucleases A, U2, CL3, Ti, Phy M, B. cereus or chemical degradation 
including, but not limited to, piperidine-promoted backbone cleavage of abasic sites 
5 (following treatment with sodium hydroxide, hydrazine, piperidine formate, or dimethyl 
sulfate), or metal-assisted {e.g. nickel(I]5> cobaltODO, or iron(II)) oxidative cleavage. 

5.6.13 Microwave Spectroscopy 
In another embodiment, the complexed beads are separated from 

10 xmcomplexed beads by microwave spectroscopy. For example, as described in U.S. Patent 
Nos. 6,395,480; 6,376,258; 6,368,795; 6,340,568; 6,338,968; 6,287,874; and 6,287,776 to 
Hefti, the disclosures of which are hereby incorporated by reference, the unique dielectric 
properties of molecules and binding complexes, such as hybridization complexes formed 
between a nucleic acid probe and a nucleic acid target, molecular binding events, and 

15 protein/ligand complexes, result in varying microwave spectra which can be measured. The 
molecule's dielectric properties can be observed by coupling a test signal to the molecule 
and observing the resultiug signal. When the test signal excites tihie molecule at a frequency 
within the molecule's dispersion regime, especially at a resonant frequency, the molecule 
will interact strongly with the signal, and the resulting signal will exhibit dramatic 

20 variations in its measured amplitude and phase, thereby generating a unique signal 

response. This response can be used to detect and identify the bound molecular structure- 
In addition, because most molecules will exhibit different dispersion properties over the 
same or different frequency bands, each generates a unique signal response which can be 
used to identify the molecular structure. 

25 

5.7 Methods for Identifymg or Characterizing the 
Compounds Bound to the Target Nucleic Acids 

If the library comprises arrays or microarrays of compounds, wherein each 
compound has an address or identifier, the compound can be deconvoluted, e,g,, by cross- 
30 referencmg the positive sample to original compound list that was applied to the individual 
test assays. 

If the library is a peptide or nucleic acid library, the sequence of the 
compound can be determined by direct sequencing of the peptide or nucleic acid. Such 
methods are weU known to one of skill in the art. 
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A number of physico-chemical teclimques can be used for the de novo 

characterization of compounds bound to the target. 



5-7.1 Mass Spectrometry 

Mass spectrometry (e,g,, electrospray ionization ("ESI") and matrix-assisted 
5 laser desorption-ionization ('MALDr*), Fourier-transform ion cyclotron resonance (*TT- 
ICR")) can be used both for high-throughput screening of compounds that bind to a target 
RNA and elucidating the structure of the compound. Thus, one example of mass 
spectroscopy is that separation of a bound and unbound complex and compound structure 
elucidation can be carried out in a single step. 
10 MALDI uses a pulsed laser for desorption of the ions and a time-of-flight 

analyzer, and has been used for the detection of noncovalent tKNA:amino-acyl-tiRNA 
synthetase complexes (Gruic-Sovulj et aL, 1997, J. BioL C3iem. 272:32084-32091). 
However, covalent cross-linking between the target nucleic acid and the compound is 
required for detection, since a non-covalently bound complex may dissociate during the 
1 5 MALDI process. 

ESI mass spectrometry ^ESI-MS") has been of greater utility for studying 
non-covalent molecular interactions because, unlike the MALDI process, ESI-MS generates 
molecular ions with little to no fragmentation (Xavier et al, 2000, Trends Biotechnol, 
1 8(8):349-356). ESI-MS has been used to study the complexes formed by HIV Tat peptide 
20 and protem with the TAR RNA (Sannes-Lowery et al, 1997, Anal. Chem. 69:5130-5135). 

Fourier-transform ion cyclotron resonance ("FT-ICR") mass spectrometry 
provides high-resolution spectra, isotope-resolved precursor ion selection, and accurate 
mass assignments (Xavier et al^ 2000, Trends Biotechnol. 1 8(8):349-356). FT-ICR has 
been used to study the interaction of aminoglycoside antibiotics with cognate and non- 
25 cognate RNAs (Hofetadler et a/., 1999, Anal. Chem. 71 :3436-3440; Griffey et a/., 1999, 
Proc. Nafl. Acad. Sci. USA 96:10129-10133). As true for all of the mass spectrometry 
methods discussed herein, FT-ICR does not require labeling of the target RNA or a 
compound. 

An advantage of mass spectroscopy is not only the elucidation of the 
30 structure of the compound, but also the determination of the structure of the compoimd 
bound to the preselected target RNA. Such information can enable the discovery of a 
consensus structure of a compound that specifically binds to a preselected target RNA. 
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In a specific embodiment, the structure of the compound is determined by 
time of flight mass spectroscopy ("TOF-MS"). In time of flight methods of mass 
spectrometry, charged (ionized) molecules are produced in a vacuum and accelerated by an 
electric field into a time of flight tube or drift tube. The velocity to which the molecules 
5 may be accelerated is proportional to the accelerating potential, proportional to the charge 
of the molecule, and inversely proportional to the square of the mass of the molecule. The 
charged molecules travel, z.e., "drift** down the TOF tube to a detector. The time taken for 
the molecules to travel down the tube may be interpreted as a measure of their molecular 
weight Time-of-fli^t mass spectrometers have been developed for all of tiie major 
10 ionization techniques such as, but limited to, electron impact C^EF'), infrared laser 
desoiption CTRLD'*), plasma desorption ('TD'O, fast atom bombardment ('TAB"), 
secondary ion mass spectrometry ("SIMS"), matrix-assisted laser desorption/ionization 
CMALDF'), and electrospray ionization ("ESrO- 

5.7.2 Edman Degradation 

15 In an embodiment wherein the library is a peptide library or a derivative 

thereof, Edman degradation can be used to determine the structure of the compound. In one 
embodiment, a modified Edman degradation process is used to obtain compositional tags 
for proteins, which is described in U.S. Patent No. 6,277,644 to Famsworfh et aL, which is 
hereby incorporated by reference in its entirety. The Edman degradation chemistry is 

20 separated fix>m amino acid analysis, circumventing the serial requirement of the 

conventional Edman process. Multiple cycles of coupling and cleavage are performed prior 
to extraction and compositional analysis of amino acids. The amino acid composition 
information is then used to search a database of known protein or DNA sequences to 
identify the sample protein. An apparatus for performing this method comprises a sample 

25 holder for holding the sample, a coupling agent supplier for supplying at least one coupling 
agent, a cleavage agent supplier for supplying a cleavage agent, a controller for directing 
the sequential supply of ttie coupling agents, cleavage agents, and other reagents necessary 
for performing the modified Edman degradation reactions, and an analyzer for analyzing 
amino acids. 

30 In another embodiment, the method can be automated as described in U.S. 

Patent No. 5,565,171 to Dovichi et al^ which is hereby incorporated by reference in its 
entirety. The apparatus includes a contuiuous capillary connected between two valves that 
control fluid flow in the capillary. One part of the capillary forms a reaction chamber where 
the sample may be immobilized for subsequent reaction with reagents supplied through the 
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valves. Another part of the capillary passes through or terminates in the detector portion of 
an analyzer such as an electrophoresis apparatus, liquid chromatographic apparatus or mass 
spectrometer. The apparatus may form a peptide or protein sequencer for cairying out the 
Edman degradation reaction and analyzing the reaction product produced by the reaction. 
5 The protein or peptide sequencer includes a reaction chamber for carrying out coupling and 
cleavage on a peptide or protein to produce derivatized amino acid residue, a conversion 
chamber for carrying out conversion and producing a converted amino acid residue and an 
analyzer for identifying the converted amino acid residue. The reaction chamber may be 
contained within one arm of a capillary and the conversion chamber is located in anoth^ 
10 arm of the capillary. An electrophoresis length of capillary is directly C25)illary coupled to 
the conversion chamber to allow electrophoresis separation of the converted amino acid 
residue as it leaves the conversion chamber. Identification of the converted amino acid 
residue takes place at one end of the electrophoresis length of the capillary. 

5.7,3 NMR Spectroscopy 

15 As described above, NMR spectroscopy is a technique for identifying 

binding sites in target nucleic acids by quaHtatively determining changes in chemical shift, 
specifically &om distances measured using relaxation effects. Examples of NMR that can 
be used for the invention include, but are not limited to, one-dimentional NMR, two- 
dimentional NMR, correlation spectroscopy ("COSY"), and nuclear Overhauser effect 

20 (**NOE*') spectroscopy. Such methods ofstructure determination of compounds are well 
known to one of skill in the art. 

Similar to mass spectroscopy, an advantage of NMR is the not only the 
elucidation of the structure of the compound, but also the determination of the structure of 
the compoxmd bound to the preselected target RNA. Such information can enable the 

25 discovery of a consensus structure of a compound that specifically binds to a preselected 
target RNA. 

5.7.4 Vibrational Spectroscopy 

Vibrational spectroscopy (e.g. infrared (IR) spectroscopy or Raman 
spectroscopy) can be used for elucidating the structure of the cono^pound on the isolated 
30 bead. 

Infrared spectroscopy measures the frequencies of infrared light 
(wavelengths from 100 to 10,000 nm) absorbed by the compound as a result of excitation of 
vibrational modes according to quantum mechanical selection rules which require that 

69 




wo 2004/010106 PCT/US2003/023075 
absorption of light cause a change in the electric dipole moment of the molecule. The 
infrared spectrum of any molecule is a unique pattem of absorption wavelengths of varying 
intensity that can be considered as a molecular fingerprint to identify any compound. 

Lifrared spectra can be measured in a scanning mode by measuring the 
5 absorption of individual frequencies of light, produced by a grating which separates 
frequeacies from a mixed-frequency infrared light source, by the compound relative to a 
standard intensity (double-beam instrument) or pre-measured ('blank') intensity 
(single-beam instrument). In a preferred embodiment, infrared spectra are measured in a 
pulsed mode (FT-IR) where a mixed beam, produced by an interferometer, of all infrared 

1 0 ligiht frequencies is passed through or reflected off the compound. The resulting 

interferogram, which may or may not be added with the resulting interferograms from 
subsequent pulses to increase the signal strength while averaging random noise in the 
electronic signal, is mathematically transformed into a spectrum using Fourier Transform or 
Fast Fourier Transform algorithms. 

15 Raman spectroscopy measures the difference in frequency due to absorption 

of infrared frequencies of scattered visible or ultraviolet light relative to the incident beam. 
The incident monochromatic light beam, usually a single laser frequency, is not truly 
absorbed by the compound but interacts with the electric field transiently. Most of the hght 
scattered off the sample with be unchanged (Rayleigh scattering) but a portion of the scatter 

20 Ught will have frequencies that are the sum or difference of the incident and molecular 
vibrational frequencies. The selection rules for Raman (inelastic) scattering require a 
change in polarizability of the molecule. While some vibrational transitions are observable 
in both infrared and Raman spectrometry, must are observable only with one or the other 
technique. The Raman spectrum of any molecule is a unique pattem of absorption 

25 wavelengths of varying intensity that can be considered as a molecular fingerprint to 
identify any compound. 

Raman spectra are measured by submitting monochromatic light to the 
sample, either passed through or preferably reflected off, filtering the Rayleigh scattered 
light, and detecting the frequency of the Raman scattered light. An improved Raman 

30 spectrometer is described in US Patent No. 5,786,893 to Fink et al, which is h^eby 
incorporated by reference. 

Vibrational microscopy can be measured in a spatially resolved fashion to 
address single beads by integration of a visible microscope and spectrometer. A 
microscopic infrared spectrometer is described in U.S. Patent No. 5,581,085 to Refber et 
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al , which is hereby incorporated by reference in its entirety. An instrument that 
simultaneously performs a microscopic infrared and microscopic Raman analysis on a 
sample is described in U.S. Patent No. 5,841,139 to Sostek et aL, which is hereby 
incorporated by reference in its entirety. 
5 In one embodiment of the method, compounds are synthesized on 

polystyrene beads doped with chemically modified styrene monomers such that each 
resulting bead has a characteristic pattem of absorption lines in the vibrational (IR or 
Raman) spectrum, by methods including but not limited to those described by Fenniri et al, 
2000, J. Am. Chem. Soc. 123:8151-8152. Using metiiods of split-pool synthesis familiar to 

10 one of skill in the art, the library of compounds is prepared so that the spectroscopic pattem 
of the bead identifies one of the components of the compound on the bead. Beads that have 
been separated according to their ability to bind target RNA can be identified by their 
vibrational spectrum. In one embodiment of the method, appropriate sorting and binning of 
the beads during synthesis then allows identification of one or more further components of 

15 the compound on any one bead. In another embodiment of the method, partial 

identification of the compound on a bead is possible through use of the spectroscopic 
pattem of the bead with or without the aid of further sorting during synthesis, followed by 
partial resynthesis of the possible compoimds aided by doped beads and appropriate sorting 
during synthesis. 

20 In another embodiment, the IR or Raman spectra of compounds are 

examined while the compound is still on a bead, preferably, or after cleavage from bead, 
using methods including but not limited to photochemical, acid, or heat treatm^L The 
compound can be identified by comparison of the IR or Raman spectral pattem to spectra 
previously acquired for each compound in the combinatorial library. 

25 In a specific embodiment, compounds can be identified by matching the IR 

or Raman spectra of a compound to a dataset of vibrational (IR or Raman) spectra 
previously acquired for each compound in the combinatorial library. By this method, the 
spectra of compounds with known stmcture are recorded so that comparison with these 
spectra can identify compounds again when isolated from RNA binding experiments. 

30 5.7.5 Microwave Spectroscopv 

In another embodiment, the microwave spectra of a compound can be used 
to elucidate the stracture of the compound. For example, as described in U.S. Patent Nos. 
6,395,480; 6,376,258; 6,368,795; 6,340,568; 6,338,968; 6,287,874; and 6,287,776 to Heftt, 
the disclosures of which are hereby incorporated by reference, the unique dielectric 
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properties of molecules and binding complexes, such as hybridization complexes formed 
between a nucleic acid probe and a nucleic acid target, molecular binding events, and 
protein/ligand complexes, result in varying microwave spectra which can be measured. The 
molecule's dielectric properties can be observed by coupUng a test signal to the molecule 
5 and observing the resulting signal. When the test signal excites the molecule at a frequency 
within the molecule's dispersion regime, especially at a resonant frequency, the molecule 
will interact strongly with the signal, and the resulting signal will exhibit dramatic 
variations in its measured amplitude and phase, thereby generating a unique signal 
response. This response can be used to detect and identify the bound molecular structure. 
10 In addition, because most molecules will exhibit different disperdon properties over the 
same or different frequency bands, each generates a unique signal response which can be 
used to identify the molecular structure. 

5.7.6 X-Rav Crvstallographv 
X-ray crystallography can be used to elucidate the structure of a compound. 

15 For a review of x-ray crystallography see, Blundell et al. 2002, Nat Rev Drag Discov 
l(l):45-54. The first step in x-ray crystallography is the formation of crystals. The 
formation of cxystals begins with the preparation of highly purified and soluble samples. 
The conditions for crystallization is then determined by optimizing several solution 
variables known to induce nucleation, such as pH, ionic strength, tenq)erature, and specific 

20 concentrations of organic additives, salts and detergent. Techniques for automating the 
crystallization process have been developed to automate the production of high-quaUty 
protein crystals. Once crystals have been formed, the crystals are harvested and prepared 
for data collection. The crystals are then analyzed by diffraction (such as multi-circle 
diffractometers, high-speed CCD detectors, and detector off-set). Generally, multiple 

25 crystals must be screened for structure determinations. 

A nimiber of methods can be used to acquire a diffraction patter so that a 
compound can be characterized. In one embodiment, an X-ray source is provided, for 
example, by a rotating anode generator producing an X-ray beam of a characteristic 
wavelength. There are a number of sources of X-ray radiation that may be used in the 

30 methods of the invention, including low and high intensity radiation. In one example, the 
tunable X-ray radiation is produced by a Synchrotron. In anoflier embodiment, the primary 
X-ray beam is monochromated by either crystal monochron:iators or focusing mirrors and 
the beam is passed through a helium flushed collimator. In a preferred embodiment, the 
crystal is mounted on a pin on a goniometer head, that is mounted to a goniometer which 
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allows to position the crystal in different orientations in the beam. The diffracted X-rays can 
be recorded using a number of techniques, including, but not limited to image plates, 
multiwire detectors or CCD cameras. In other embodiments, flash cooling, for example, of 
protein crystals, to cryogenic temperatures (-100 K) offers many advantages, the most 
5 significant of which is the elimination of radiation damage. 

5.8 Naturally Occurring Genes with Premature Stop Codons: 
Examples of Disorders and Diseases 

The invention provides for naturally occurring genes with premature stop 
codons to ascertain the effects of compounds on premature translation termination and/or 

10 nonsense-mediated mRNA decay. In general, the expression of the gene product, in 
particular, a fiill-length gene product, is indicative of the effect of the compounds on 
premature translation termination and/or nonsense-mediated mRNA decay. 

In a preferred embodiment, the naturally occurring genes with premature 
stop codons are genes that cause diseases which are due, in part, to the lack of expression of 

15 the gene resulting from the premature stop codon. Such diseases include, but are not 
Umited to, cystic fibrosis, muscular dystrophy, heart disease (e.g^., familial 
hypercholesterolemia), p53-associated cancers {e.g., lung, breast, colon, pancreatic, non- 
Hodgkin's lymphoma, ovarian, and esophageal cancer), colorectal carcinomas, 
neurofibromatosis, retinoblastoma, Wilm's tumor, retinitis pigmentosa, collagen disorders 

20 {e.g., osteogenesis imperfecta and cirrhosis), Tay Sachs disease, blood disorders {e.g. , 
hemophilia, von Willebrand disease, b-Thalassemia), kidney stones, ataxia-telangiectasia, 
lysosomal storage diseases, and tuberous sclerosis. Genes involved in the etiology of these 
diseases are discussed below. 

The recognition of translation tennination signals is not necessarily limited 

25 to a simple trinucleotide stop codon, but is instead recognized by the sequences surrounding 
the stop codon in addition to the stop codon itself (see, e,g,, ManuvaMiova et aL, 2000, 
RNA 6(7): 1044-1 055, which is hereby incorporated by reference in its entirety). Thus, any 
genes containing particular tetranucleotide sequences at the stop codon, such as, but not 
limited to, UGAC, UAGU, UAGC, UAGG, UAAC, UAAU, UAAG, and UAAA, are 

30 candidates of naturally occurring genes with premature stop codons that are useful in the 
present invention. Human disease genes that contain these particular sequence motifs are 
sorted by chromosome is presented as an Example in Section 8. 

5.8.1 Cystic Fibrosis 
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Cystic fibrosis is caused by mutations ia the cystic fibrosis conductance 



regulator ("CFTR") gene. Such mutations vary between populations and depend on a 
multitude of factors such as, but not limited to, ethnic background and geographic location. 
Nonsense mutations in the CFTR gene are expected to produce little or not CFTR chloride 
5 channels. Several nonsense mutations in the CFTR gene have been identified (see, e.g., 
Tzetis et al, 2001, Hum Genet. 109(6):592-601. Strandvik et d., 2001, Genet Test. 
5(3):23S-42; Feldmami et al., 2001, Hum Mutat. 17(4):3S6; Wilschanski et al., 2000, Am J 
Respir Grit Care Med. 161(3 Pt l):860-5; Castaldo et al, 1S99, Hum Mutat. 14(3):272; 
Mittre et al., 1999, Hum Mutat. 14(2):182; Mickle et al, 1998, Hum Mol Genet. 

10 7(4):729-35; Casals et al, 1997, Hum Genet. 101(3):365-70; Mittre et al, 1996, Hum 
Mutat. 8(4):392-3; Bonizzato et al., 1995, Hum Genet. 1995 Apr;95(4):397-402; Greil et 
al, 1995, WienKlin WocheDSclir. 107(15):464-9; Zielenski etal, 1995, Hum Mutat 
5(l):43-7; Dork et al, 1994, Hum Genet. 94(5):533-42; Balassopoulou et al, 1994, Hum 
Mol Genet. 3(10):1887-8; Ghanem et al, 1994, 21(2):434-6; Will et al, J Clin Invest. 1994 

15 Apr;93(4): 1852-9; Hull et al, 1994, Genomics. 1994 Jan 15;19(2):362-4; Dork et al, 1994, 
Hum Genet. 93(l):67-73; Rolfini & Cabrini, 1993, J Clin Invest. 92(6):2683-7; Will et al, 
1993, J Med Genet 30(10):833-7; Bienvenu et al, 1993, J Med Genet. 30(7):621-2; 
Cheadle et al, 1993, Hum Mol Genet 2(7):1067-8; Casals et al, 1993, Hum Genet 
91(l):66-70; Reiss et al, 1993, Hum Genet 91(l):78-9; ChevaUer-Porst et al, 1992, Hum 

20 Mol Genet l(8);647-8; Hamosh et al, 1992, Hum Mol Genet. l(7):542-4; Gasparini et al., 
1992, J Med Genet. 29(8):558-62; Fanen et al, 1992, Genoinics. 13(3):770-6; Jones et al., 
1992, Hum Mol Genet l(l):ll-7; Ronchetto et al, 1992, Genomics. 12(2):417-8.; Macek 
et al, 1992, Hum Mutat l(6):501-2; Shoshani et al, 1992, Am J Hum Genet. 50(l):222-8; 
Schloesser et al, 1991, J Med Genet. 28(12):878-80; Hamosh et al., 1991, J Clin Invest 

25 88(6):1880-5; Bal et al., 1991, J Med Genet. 28(10):715-7; Doric et al, 1991, Hum Genet 
87(4):441-6; Beaudet et al, 1991, Am J Hum Genet. 48(6): 1213; Gasparini et al, 1991, 
Genomics. 10(l):193-200; Cutting etal, 1990, N Engl J Med. 1990, 323(24): 1685-9; and 
Kerem et al, 1990, Proc Nati Acad Sci USA. 87(21):8447-51, ttie disclosures of which are 
hereby incorporated by reference in their entireties). Any CFTR gene encoding a premature 

30 translation codon including, but not limited to, the nonsense mutations described in the 
references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsens&-mediated mRNA decay. 



5.8^ Muscular Dystrophy 
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Muscular dystrophy is a genetic disease characterized by severe, progressive 
muscle wasting and weakness. Duchenne muscular dystrophy and Becker muscular 
dystrophy are generally caiised by nonsense mutations of the dystrophin gene (see, e.g., 
Kerr et al., 2001, Hum Genet 109(4):402-7 and Wagner et al., 2001, Ann Neurol. 
5 49(6):706-l 1). Nonsense mutations in other genes have also been implicated in other types 
of muscular dystrophy, such as, but not limited to, collagen genes in Ullrich congenital 
muscular dystrophy (see, e.g., Demir et aL, 2002, Am J Hum Genet. 70(6): 1446-58), the 
^erin gene and lamins genes in Em^-Dreifuss muscular dystrophy (see, cg.^ Holt et al., 
2001, Biochem Biophys Res Commun. 287(5):1 129-33; Becane et al., 2000, Pacing Clin 

10 ElectrophysioL 23(11 R l):1661-6; and Bonnet a/., 2000, Ann Neurol. 48(2):170-80.), the 
dysferlin gene in Miyoshi myopathy (see, e.g., Nakagawa et al., 2001, J Neurol Sci. 
184(l):15-9), the plectin gene in late onset muscular dystrophy (see, e,g., Bauer et al., 2001, 
Am J Pathol. 158(2):617-25), the delta-sarcoglycan gene in recessive limb-girdle muscular 
dystrophy (see, e.g., Duggan et al., 1997, Neurogenetics. l(l);49-58), the laminina2-chain 

15 gene in congenital muscular dystrophy (see, e.g., Mendell et al., 1998, Hum Mutat. 

12(2): 135), the plectin gene in late-onset muscular dystrophy (see, e.g., Rouan et al., 2000, J 
Invest Dennatol. 1 14(2):381-7 and Kunz et al., 2000, J Invest Demiatol. 1 14(2):376-80), 
the myophosphorylase gene in McArdle's disease (see, e.g., Bruno et aL, 1999, 
Neuromuscul Disord. 9(l):34-7), and the collagen VI in Bethlem myopathy (see, e.g., 

20 Lamande et al., 1998, Hum Mol Genet. 1998 Jun;7(6):981-9). 

Several nonsense mutations in the dystrophin gene have been identified (see, 
Kerr et al., 2001, Hum Genet. 109(4):402-7; Mendell et al., 2001, Neurology 
57(4):645-50; Fajkusova et al., 2001, Neuromuscul Disord. 1 1(2):133-8; Ginjaar et al, 
2000, Eur J Hum Genet. 8(10):793-6; Lu et al, 2000, J Cell Biol. 148(5):985-96; 

25 Tuffery-Giraud et al, 1999, Hum Mutat. 14(5):359-68; Fajkusova et al, 1998, J 

Neurogenet. 12(3):183-9; Tuffery et al, 1998, Hum Genet. 102(3):334-42; Shiga et al, 
1997, J Clin Invest. 100(9):2204-10; Winnard et al, 1995, Am J Hum Genet 56(l):158-66; 
Prior et al, 1994, Am J Med Genet. 50(l):68-73; Prior et al, 1993, Hum Mol Genet. 
2(3):311-3; Prior et al, 1993, Hum Mutat. 2(3):192-5; Nigro et al., 1992, Hum Mol Genet. 

30 l(7):517-20; Worton, 1992, J hiherit Metab Dis. 15(4):539-.50; and Buhnan et al, 1991, 
Genomics. 10(2):457-60; the disclosures of which are hereby incorporated by reference in 
their entireties). Any gene encoding a premature translation codon implicated in muscular 
dystrophy including, but not limited to, the nonsense mutations described m the references 
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cited above, can be used in the present invention to identify compounds that mediate 
premature translation termination and/or nonsense-mediated mRNA decay. 

5.S3 Fanulial Hypercholesterolemia 

Hypercholesterolemia, or high blood cholesterol, results from either the 
5 overproduction or the underutilization of low density lipoprotein ("LDL"). 

Hypercholesterolemia is caused by either the genetic disease familial hypercholesterolemia 
or the consumption of a high cholesterol diet. Nonsense mutations in the LDL receptor 
gene have been impUcated in familial hypercholesterolemia. Several nonsense mutations in 
the LDL receptor gene have been identified (see, e.g,^ Lind et al^ 2002, Atherosclerosis 

10 163(2):399-407; Salazar et al, 2002, Hum Mutat 19(4):462-3; Kuhrova et al, 2002, Hum 
Mutat. 19(1):80; Zakharova et al, 2001, Bioorg Khim. 27(5):393-6; Kuhrova et al, 2001, 
Hum Mutat. 18(3):253; Genschel et al, 2001, Hum Mutat 17(4):354; Weiss et al, 2000, J 
Inherit Metab Dis. 23(8):778-90; Mozas et al, 2000, Hum Mutat 15(5):483-4; Shin et al, 
2000, Clin Genet, 57(3):225-9; Graham et al, 1999, Atherosclerosis 147(2):309-16; Hattori 

15 et al, 1999, Hum Mutat 14(1):87; Cenarro et al., 1998, Hum Mutat 1 1(5):413; Rodningen 
et al, 1999, Hum Mutat 13(3):186-96; Hirayama et al, 1998, J Hum Genet. 43(4):250-4; 
Lind et al, 1998, J Mem Med. 244(l):19-25; Thiart et al, 1997, Mol Cell Probes 
1 1(6):457-8; Maruyama et al, 1995, Arterioscler Thromb Vase Biol. 15(10):1713-8; 
Koivisto et al, 1995, Am J Hum Genet 57(4):789-97; Lombaidi et al, 1995, J Lipid Res. 

20 36(4):860-7; Leren et al, 1993, Hum Genet 92(1):6-10; Landsberger et al., 1992, Am J 
Hum Genet 50(2):427-33; Loux et al., 1992, Hum Mutat 1992;l(4):325-32; Motulsky, 
1989, Arteriosclerosis. 9(1 Suppl):B-7; Lehmian et al, 1987, J Biol Chem. 262(1):401-10; 
and Lehrman et al, 1985, Cell 41(3):735-43; the disclosures of which are hereby 
incorporated by reference in their entireties). Atiy LDL receptor gene encoding a premature 

25 translation codon including, but not lunited to, the nonsense mutations described in the 
references cited above, can be used in the preseit invention to identify compounds that 
mediate premature translation tennination and/or nonsense-mediated mRNA decay. 

5.8.4 p53"associated Cancers 

Mutant forms of the p53 protein, which is thought to act as a negative 
30 regulator of cell proliferation, transformation, and tumorigenesis, have been implicated as a 
common genetic change characteristic of human cancer (see, e.g,, Levine et al, 1991, 
Nature 351:453-456 and HoUstein et al, 1991, Science 253:49-53). p53 mutations have 
been implicated in cancers such as, but not limited to, lung cancer, breast cancer, colon 
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cancer, pancreatic cancer, non-Hodgkin's lymphoma, ovarian cancer, and esophageal 
cancer. 

Nonsense mutations have been identified in the p53 gene and have been 
implicated in cancer. Several nonsense mutations in the p53 gene have been identified (see, 
5 e^., Masuda et al, 2000, Tokai J Exp Clin Med. 25(2):69-77; Oh et aL, 2000, Mol Cells 
10(3):275-80; li et al, 2000, Lab Invest. 80(4):493-9; Yang et al, 1999, Zhonghua Zhong 
Liu Za Zhi 21(2):1 14-8; Finkelstein et at, 1998, Mol Diaga. 3(1):37-41; Kajiyama et al, 
1998, Dis Esophagus. ll(4):279-83;Kawamurae^a/., 1999,LeukRes. 23(2):115-26;Radig 
et al, 1998, Hum Pafhol. 29(ll):1310-6; Schuyer et al, 1998, Iht J Cancer 76(3):299-303; 

10 Wang-Gohrke et al, 1998, Oncol Rep. 5(l):65-8; Fulop et al, 1998, J Repirod Med. 
43(2):119-27; Ninomiya et al, 1997, J Dennatol Sd. 14(3):173-8; Hsieh et al, 1996, 
Cancer Lett. 100(1-2): 107-13; Rail etal, 1996, Pancreas. 12(l):10-7; Fukutomi et al, 
1995, Nippon Rinsho. 53(1 1):2764-8; Frebourg et al, 1995, Am J Hum Genet. 
56(3):608-15; Dove et al, 1995, Cancer Surv, 25:335-55; Adamson et al, 1995, Br J 

15 Haematol. 89(l):61-6; Grayson et al, 1994, Am J Pediatr Hematol Oncol. 16(4):341-7; 
Lepelley et al, 1994, Leukemia. 8(8):1342-9; McLityre et al, 1994, J Chn Oncol. 
12(5):925-30; Horio et al, 1994, Oncogene. 9(4):1231-5; Nakamura et al, 1992, Jpn J 
Cancer Res. 83(12): 1293-8; Davidofif et al., 1992, Oncogene. 7(l):127-33; and Ishioka et 
al, 1991, Biochem Biophys Res Commun. 177(3):901-6; the disclosures of which are 

20 hereby incorporated by reference in their entireties). Any p53 gene encoding a premature 
translation codon including, but not limited to, the nonsense mutations described in the 
references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsense-mediated mRNA decay. 

5.8.5 Colorectal Carcinomas 

25 Molecular genetic abnormalities resulting in colorectal carcinoma involve 

tumor-suppressor genes that undergo inactivation (such as, but not limited to, ape, mcc, dec, 
p53, and possibly genes on chromosomes 8p, Ip, and 22c|) and dominant-acting oncogenes 
(such, but not limited to, ras, src, and myc) (see, Hamilton, 1992, Cancer 70(5 
Suppl):1216-21). Nonsense mutations in the adenomatous polyposis coli ("APC") gene and 

30 mismatch repair genes (such as, but not limited to, mlhl and msh2) have also been 

described. Nonsense mutations have been implicated in colorectal carcinomas (see, e.g,, 
Viel et al, 1997, Genes Chromosomes Cancer. 18(1):8-18; Akiyama et al, 1996, Cancer 
78(12):2478-84; Itoh & Imai , 1996, Hokkaido IgakuZasshi 71(1):9-14; Kolodner et al, 
1994, Genomics. 24(3):5 16-26; Ohue et al, 1994, Cancer Res. 54(17):4798-804; and Yin et 
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al, 1993, Gastroenterology. 104(6):1633-9; the disclosures of which are hereby 
incorporated by reference in their entireties). Any gene encoding a premature translation 
codon implicated in colorectal carcinoma including, but not limited to, the nonsense 
mutations described in the references cited above, can be used in the present invention to 
5 identify compounds that mediate premature translation termination and/or nonsense- 
mediated mKNA decay. 

5.8.6 Neurofibromatosis 
Neurofibromatosis is an inherited disorder, which is coromonly caused 
caused by mutations in the NFl and NF2 tumor suppressor genes. It is characterized by 

10 multiple intracranial tumors including schwaimomas, meningiomas, and ependymomas. 
Nonsense mutations in the NFl and NF2 genes have been described. Nonsense mutations 
have been implicated in neurofibromatosis (see, e.g., Lams2xis et al^ 2001, Int J Cancer 
91(6):803-8; Sestini et al, 2000, Hum Genet. 107(4):366-71; Fukasawa et al, 2000, Jpn J 
Cancer Res. 91 (12): 1241 -9; Park et al, 2000, J Hum Genet. 45(2):84-5; Ueki et al, 1999, 

15 * Cancer Res. 59(23):5995-8; , 1999, Hokkaido Igaku Zasshi. 74(5):377.86; Buske et al, 
1999, Am J Med Genet. 86(4):328-30; Harada et al, 1999, Surg Neurol. 51(5):528-35; 
Krkljuse^a/., 1998, Hum Mutat. ll(5):411;Klosee/a/., 1999, Am J Med Genet. 
83(1):6.12; Park & Pivnick, 1998, J Med Genet. 35(10):813-20; Bahuau et al, 1998, Am J 
Med Genet. 75(3):265-72; Bijlsma et al, 1997, J Med Genet. 34(1 1):934-6; MacCollin et 

20 a/., 1996, Ann Neurol. 40(3):440-5; Upadhyaya et al, 1996, Am J Med Genet 67(4):421-3; 
Robinson et al, 1995, Hum Genet. 96(l):95-8.; Legius et al, 1995, J Med Genet. 
32(4):316-9; von Deunling et al, 1995, Brain Pathol. 5(1):1 1-4; DubUn et al, 1995, Hum 
Mutat 5(l):81-5; Legius et al, 1994, Genes Chromosomes Cancer. 10(4):250-5; Purandare 
et al, 1994, Hum Mol Genet 3(7):1 109-15; Shen & Upadhyaya, 1993, Hum Genet 

25 92(4):410-2; and Estivill et al, 1991, Hum Genet 88(2):185-8; the disclosures of which are 
hereby incorporated by reference in their entireties). Any gene encoding a premature 
translation codon implicated in neurofibromatosis including, but not limited to, the 
nonsense mutations described in the references cited above, can be used in the present 
invention to identify compounds that mediate premature translation termination and/or 

30 nonsense-mediated mRNA decay. 

S.8.7 Retinoblastoma 

The retinoblastoma gene plays important roles in the genesis of human 
cancers. Several pieces of evidence have shown that the retinoblastoma protein has dual 
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roles in gating cell cycle progression and promoting cellular differentiation (see, e.g., Lee & 
Lee, 1997, Gan To Kagaku Ryoho 24(1 1):1368-80 for a review). Nonsense mutations in 
the RBI gene have been described. Nonsense mutations have been implicated in 
retinoblastoma (see, e.g.. Klutz et al., 2002, Am J Hum Genet. 71(l):174-9; Alonso et al., 
5 2001, Hum Mutat. 17(5):412-22; Wong et al., 2000, Cancer Res. 60(21):6171-7; Harbour , 
1998, Ophthahnology 105(8): 1442-7; Fulop et al., 1998, J Reprod Med. 43(2):119-27; 
Onadim et al., 1997, Br J Cancer 76(11): 1405-9; Lohmann et al., 1997, Ophthahnologe 
94(4):263-7; CoweU & Cragg, 1996, Eur J Cancer. 32A(10):1749-52; Lohmann et al., 1996, 
Am J Hum Genet. 58(5):940-9; Sh^iro et al., 1995, Cancer Res. 55(24) :6200-9; Huang et 

10 al., 1993,CancerRes.53(8):1889-94;andCheng&Haas, 1990, Mol Cell Biol. 

10(10):5502-9; the disclosures of which are hereby incorporated by reference in then- 
entireties). Any gene encoding a premature translation codon implicated in retinoblastoma 
including, but not limited to, the nonsense mutations described in the references cited 
above, can be used in the present invention to identify compounds that mediate premature 

1 5 translation termination and/or nonsense-mediated mRNA decay. 

5.8.8 Wihn^s Tumor 
Wihn's tumor, or nephroblastoma, is an embryonal malignancy of the kidney 
that affects children. Nonsense mutations in the WTl gene have been implicated in Wilm's 
tumor. Several nonsense mutations in the WTl have been identified (see, e.g,, Nakadate et 

20 al, 1999, Genes Chromosomes Cancer 25(l):26-32; Diller et al, 1998, J Clin Oncol. 

16(1 1):3634-40; Schumacher et al, 1997, Proc Natl Acad Sci U S A. 94(8):3972-7; Coppes 
et al, 1993, Proc Natl Acad Sci USA. 90(4):1416-9; and Little et al, 1992, Proc Natl 
Acad Sci U S A. 89(1 1):4791-5; the disclosures of which are hereby incorporated by 
reference in their entireties). Any WTl gene encoding a premature translation codon 

25 including, but not limited to, the nonsense mutations described in the references cited 

above, can be used in the present invention to identify compounds that mediate premature 
translation termination and/or nonsense-mediated mRNA decay. 

5.8.9 Retinitis Pigmentosa 
Retinitis pigmentosa is a genetic disease in which affected individuals 
30 develop progressive degeneration of llie rod and cone photoreceptors. Retinitis pigmentosa 
caimot be explained by a single genetic defect but instead the hereditary aberration 
responsible for triggering the onset of the disease is localized in different genes and at 
different sites within tihese genes (reviewed in, e.g,, KoUer et aL^ 1997, KLlin Monatsbl 
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Augenheilkd 211 (2):84-93). Nonsense mutations have been implicated in retinitis 
pigmentosa (see, e.g., Ching et al, 2002, Nevirology 58(1 1):1673-4; Zhang et al, 2002, 
Zhonghua Yi Xue Yi Chuan Xue Za Zhi. 19(3):194-7; Zhang et al, 2002, Hum Mol Genet. 
1;11(9):993-1003; Dietrich et al, 2002, Br J Qphthahnol. 86(3):328-32; Grayson et al, 
5 2002, J Med Genet. 39(l):62-7; Liu et al., 2001, Zhonghua Yi Xue Za Zhi 81(2):71-2; 
Damji et al, 2001, Can J Ophthahnol. 36(5):252-9; Berson et al, 2001, Invest Ophthalmol 
Vis Sci. 42(10):2217-24; Chan et al, 2001, Br J Qphthahnol. 85(9):1046-8; Baum et al, 
2001, Hum Mutat 17(S):436; Mashima et al, 2001, Ophthahnic Genet. 22(l):43-7; 
Zwaenepoel c/ al, 2001, Hum Mutat. 2001;17(1):34-41; Bork et al., 2001, Am J Hum 

10 Genet. 68(l):26-37; Sharon et al., 2000, Invest Ophtihiahnol Vis Sci. 41(9):2712-21; Dreyer 
et al, 2000, Eur J Hum Genet. 8(7):500-6; Liu et al, 2000, Hum Mutat. 15(6):584; Wang et 
al, 1999, Exp Eye Res. 69(4):; Bowne et al, 1999, Hum Mol Genet. 8(ll):2121-8; 
Guillonneau et al, 1999, Hum Mol Genet. 8(8):1541-6; Dryja et al, 1999, Invest 
Ophflialmol Vis Sci. 40(8): 1859-65; Sullivan et al, 1999, Nat Genet. 22(3):255-9; Pierce et 

15 al, 1999, Nat Genet 22(3):248-54; Janecke et al, 1999, Hum Mutat. 13(2):133-40; Cuevas 
et al, 1998, Mol Cell Probes 12(6):417-20; Schwahn et al, 1998, Nat Genet. 19(4):327-32; ■ 
Buraczynska et al., 1997, Am J Hum Genet 61(6):1287-92; Meindl et al, 1996, Nat Genet 
13(l):35-42; Keen et al, 1996, Hum Mutat 8(4):297-303; Dryja et al, 1995, Proc Natl 
Acad Sci U S A. 92(22):10177-81; Apfelstedt-SyUa et al, 1995, Br J Ophthalmol. 

20 79(l):28-34; Bayes et al, 1995, Hum Mutat. 5(3):228-34; Shastry, 1994, Am J Med Genet 
52(4):467-74; Gal et al, 1994, Nat Genet 7(l):64-8; Sargan et al, 1994, Gene Ther. 1 
Siqjpl 1:S89; McLaughlin et al, 1993, Nat Genet. 4(2):130-4; Rosenfeld et al, 1992, Nat 
Genet. 1(3):209-13; the disclosures of which are hereby incorporated by reference in their 
entireties). Any gene encoding a premature translation codon implicated in retinitis 

25 pigmentosa including, but not limited to, the nonsense mutations described in the ref^ences 
cited above, can be used in the present invention to identify compoimds that mediate 
premature translation termination and/or nonsense-mediated mSNA decay. 

5.8.10 Osteogenesis Imperfecta 

Osteogenesis imperfecta is a heterogeneous disorder of type I collagen 
30 resulting in varjnng degrees of severity and results from mutations the genes that encode the 
proalpha chains of type I collagen. Nonsense mutations have been implicated in the genes 
that encode the proalpha chains of type I collagen ("COLAl" genes) (see, e.g., Slayton et 
al, 2000, Matrix Biol. 19(l):l-9; Bateman et al, 1999, Hum Mutat 13(4):311-7; and 
Willing et al, 1996, Am J Hum Genet 59(4):799-809; the disclosures of which are hereby 
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incorporated by reference in their entireties). Any COLAl gene encoding a premature 
translation codon including, but not limited to, the nonsense mutations described in the 
references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsense-mediated mRNA decay. 

5 5.8.11 Cirrhosis 

Cirrhosis generally refers to a chronic liver disease that is marked by 
replacement of normal tissue with fibrous tissue. The multidrug resistance 3 gene has been 
implicated in cirrhosis, and nonsense mutations have been identified in this gene (see, e.g., 

Jacquemin et al, 2001, Gastroenterology. 2001 May;120(6): 1448-58; the 
10 disclosure of which is hereby incorporated by reference in its entirety). Any gene involved 
in cirrhosis encoding a premature translation codon including, but not limited to, the 
nonsense mutations described ia the reference cited above, can be used in the present 
invention to identify compounds that mediate premature translation termination and/or 
nonsense-mediated noKNA decay. 

15 5.8.12 Tav Sachs Disease 

Tay Sachs disease is an autosomal recessive disorder afifecting the central 
nervous system. The disorder results firom mutations in the gene encoding the 
alpha-subimit of beta-hexosaminidase A, a lysosomal enzyme composed of alpha and beta 
polypeptides. Several nonsense mutations have been implicated in Tay Sachs disease (see, 

20 e,g., Rajavel & Neufeld, 2001, Mol Cell Biol. 21(16):55l2-9; Myerowitz, 1997, Hum 

Mutat 9(3):195-208; AkU et al, 1993, Hum Genet 90(6):614-20; Mules et al, 1992, Am J 
Hum Genet. 50(4):834-41; and Akli a/., 1991, Genomics. ll(l):124-34; the disclosures 
of which are hereby incorporated by reference in their entireties). Any hexosaminidase 
gene encoding a premature translation codon including, but not limited to, the nonsense 

25 mutations described in the references cited above, can be used in the present invention to 
identify compounds that mediate premature translation termination and/or nonsense- 
mediated mRNA decay. 

5.8.13 Blood Disorders 
Hemophilia is caused by a deficiency in blood coagulation factors. Affected 
30 individuals are at risk for spontaneous bleeding into organs and treatment usually consists 
of administration of clotting factors. Hemophilia A is caused by a deficiency of blood 
coagulation factor VUI and hemophilia B is caused by a deficiency in blood coagulation 
factor IX. Nonsense mutations in the genes encoding coagulation factors have been 
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implicated in hemophilia (see, e,g,, Dansako et aL, 2001, Ann Hematol. 80(5):292-4; 
Moller-Morlang et aL, 1999, Hum Mutat. 13(6):504; Kamiya et aL, 1998, Rinsho Ketsueki 
39(5):402-4; Freson et aL, 1998, Hum Mutat. 1 1(6):470-9; Kamiya et aL, 1995, Int J 
Hematol. 62(3):175-81; Walter et aL, 1994, Thromb Haemost. 72(l):74-7; Figueiredo, 
5 1993, Braz J MedBiol Res. 26(9):919-31; Reiner & Thompson, 1992, Hum Genet. 

89(l):88-94; Koeberl et aL, 1990, Hum Genet. 84(5):387-90; Driscoll et aL, 1989, Blood. 
74(2):737-42; Chen et aL, 1989, Am J Hum Genet 44(4):567-9; Mikami et aL, 1988, Jinrui 
Idengaku Zasshi. 33(4):409-15; Gitschier et aL, 1988, Blood 72(3): 1022-8; and Sommer et 
aL, 1987, Mayo Clin Proc. 62(5):387-404; the disclosures of which are hereby incorporated 
10 by reference in their entireties). Any gene encoding a premature translation codon 

implicated in hemophUia including, but not limited to, the nonsense mutations described in 
the references cited above, can be used in the present invention to identify compounds that 
mediate premature translation termination and/or nonsense-mediated rtiRNA decay. 

Von Willebraud disease is a single-locus disorder resulting from a deficiency 
15 of von Willebrand factor: a multimeric multifunctional protein involved in platelet adhesion 
and platelet-to-platelet cohesion in high shear stress vessels, and in protecting from 
proteolysis and directing circulating factor Vin to the site of mjury (reviewed in 
Rode^ero, 2002, HaemophiUa. 8(3):292-300). Nonsense mutations have impHcated in 
von Willehbrand disease (see, e.g., Rode^ero, 2002, Haemophilia. 8(3):292-300; Enayat 
20 et aL, 2001, Blood 98(3):674-80; Surdhar et aL, 2001, Blood 98(l):248-50; Casana et aL, 
2000, Br J Haematol. 1 1 1(2):552-5; Baronciani et al. , 2000, Thromb Haemost. 
84(4):536-40; FeUowes et aL, 2000, Blood 96(2):773-5; Waseem etaL, 1999, Thromb 
Haemost. 81(6):9d0-5; Mohlke et aL, 1999, Int J Clin Lab Res. 29(l):l-7; Rieger et aL, 
1998, Thromb Haemost. 80(2):332-7; Kenny et aL, 1998, Blood 92(l):175-83; Mazmier et 
25 aL, 1998, Ann Genet. 41(l):34-43; Hagiwara et aL, 1996, Thromb Haemost. 76(2):253-7; 
Mazurier & Meyer, 1996, Baillieres Clin Haematol. 9(2):229-41; Schneppenheim et aL, 
1994, Hum Genet. 94(6):640-52; Zhang et aL, 1994, Genomics 21(l):188-93; Ginsburg & 
Sadler, 1993, Thromb Haemost. 69(2):177-84; Eikenboom et aL, 1992, Thromb Haemost. 
68(4):448-54; Zhang et aL, 1992, Am J Hum Genet. 51(4):850-8; Zhang et aL, 1992, Hum 
30 Mol Genet. l(l):61-2; and Mancuso et aL, 1991, Biochemistry 30(l):253-69; the 

disclosures of which are hereby incorporated by reference in their entireties). Any gene 
encoding a premature translation codon implicated in von Willebrand disease including, but 
not limited to, the nonsense mutations described in the references cited above, can be used 
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in fhe present invention to identify compounds that mediate premature translation 
termination and/or nonsense-mediated mRNA decay. 

jS thalassemia is caused by a deficiency in beta globin polypeptides which in 
turn causes a deficiency in hemoglobin production. Nonsense mutations have been 
5 implicated in b thalassemia (see, e.g., El-Latif et al., 2002, Hemoglobia 26(l):33-40; 
Sanguansermsri et al., 2001, Hemoglobin 2S(l):19-27; Romao 2000, Blood 
96(8):2895-901; Perea et al., 1999, Hemoglobin 23(3):231-7; Rhodes et al., 1999, Am J 
Med Sci. 317(5):341-5; Fonseca et al., 1998, Hemoglobin 22(3): 197-207; Gasperini et oL, 
1998, Am J Hematol. 1998 Jan;57(l):43-7; Galaaello et al., 1997, Br J Haematol. 

10 99(2):433-6; Pistidda et al., 1997, Eur J Haematol. 58(5):320-5; Oner et al, 1997, Br J 
Haematol. 96(2):229-34; Yasunaga et al, 1995, Mem Med. 34(12): 1198-200; Molina et 
al, 1994, Sangre (Bare) 39(4):253-6; Chang etal, 1994, hit J Hematol. 59(4):267-72; 
Gihnan et al, 1994, Am J Hematol. 45(3):265-7; Chan et al, 1993, Prenat Diagn. 
13(10):977-82; George et al, 1993, Med J Malaysia 48(3):325-9; Divoky et al, 1993, Br J 

15 Haematol. 83(3):523-4; Fioretti et al, 1993, Hemoglobin 17(1):9-17; Rosatelli et al, 1992, 
Am J Hum Genet 50(2):422-6; Moi et al, 1992, Blood 79(2):512-6; Loudianos et al, 
1992, Hemoglobin 16(6):503-9; Fukumaki, 1991, Rinsho Ketsueki 32(6):587-91; Cao et al, 
1991, Am J Pediatr Hematol Oncol. 13(2):179-88; Galanello et al., 1990, Clin Geaet 
38(5):327-31; Lin, 1990, Zhongguo Yi Xue Ke Xue Yuan Xue Bao 12(2):90-5; 

20 AuldUa-Scholz et al, 1990, Hum Genet 84(2):195-7; Cao et al, 1990, AnnN Y Acad Sci. 
612:215-25; Sanguansermsri et al, 1990, Hemoglobin 14(2):157-68; Galanello e/ a/., 1989, 
Blood 74(2):823-7; RosateUi et al., 1989, Blood 73(2):601-5; GalaaeUo et al, 1989, Prog 
Clin Biol Res. 316B:113-21; Galanello etal., 1988, Am J Hematol. 29(2):63-6; Chanel a/., 
1988, Blood 72(4):1420-3; Atweh et al, 1988, J Clin hivest 82(2):557-61; Masala et al, 

25 1988, Hemoglobin 12(5-6):661-71; Pirastu et al, \9%1, Proc Natl Acad Sci U S A 
84(9):2882-5; Kazazian et al, 1986, Am J Hum Genet. 38(6):860-7; Cao et al, 1986, 
Prenat Diagn. 6(3):159-67; Cao etal, 1985, AnnN Y Acad Sci. 1985;445:380-92; Pirastu 
et al, 1984, Science 223(4639):929-30; Pirastu et al, 1983, N Engl J Med. 309(5):284-7; 
Trecartm et al, 1981, J Clin hivest 68(4):1012-7; and liebhaber et al, 1981, Trans Assoc 

30 Am Physicians 94:88-96; the disclosures of which are hereby incorporated by reference in 
their ^tireties). Any gene encoding a premature translation codon inqplicated in b 
thalassemia including, but not limited to, the nonsense mutations described in the references 
dted above, can be used in the present invention to identify compounds that mediate 
premature translation termination and/or nonsense-mediated mRNA decay. 
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5.8.14 Kidney Stones 

Kidney stones (nephrolithiasis), which affect 12% of males and 5% of 

females in the western world, are familial in 45% of patients and are most commonly 

associated with hypercalciuria (see, e.g., Lloyd et aL, Nature 1996 Feb l;379(6564):445-9). 

5 Mutations of the renal-specific chloride channel gene are associated with hjpercalciuric 

nephrolithiasis (kidney stones). Nonsense mutations have been impUcated in kidney stones 

(see, e,g., Hoopes et al, 1998, Kidney Int 54(3):698-705; Lloyd et al, 1997, Hum Mol 

Genet. 6(8):1233-9; Lloyd et al, 1996, Nature 379(6564):445-9; and Pras et al, 1995, Am J 

Hum Genet. 56(6):1297-303; the disclosures of which are hereby incorporated by reference 

10 in their entireties). Any gene encoding a premature translation codon implicated in kidney 

stones including, but not limited to, the nonsense mutations described in the references cited 

above, can be used in the present invention to identify compounds that mediate premature 

translation termination and/or nonsense-mediated mRNA decay. 

5.8.15 Ataxia-Telangiectasia 
Ataxia-telangiectasia is characterized by increased sensitivity to ionizing 
radiation, increased incidence of cancer, and neurodegeneration and is generally caused by 
mutations in the ataxia-telangiectasia gene (see, e.g.. Barlow et al, 1999, Proc Natl Acad 
Sci USA 96(17):991 5-9). Nonsense mutations have been implicated in ataxia-telangiectasia 
(see, e.g,, Camacho et al, 2002, Blood 99(l):238-44; Pitts et al, 2001, Hum Mol Genet 
10(1 1):1 155-62; Laake et al, 2000, Hum Mutat 16(3):232-46; Li & Swift, 2000, Am J 
Med Genet. 92(3):170-7; Teraoka et al, 1999, Am J Hum Genet. 64(6):1617-31; and 
Stoppa-Lyonnet et aL, 1998, Blood 91(10):3920-6; flie disclosures of which are hereby 
incorporated by reference in their entireties). Any gene encoding a premature translation 
codon hnplicated in ataxia-telangiectasia including, but not limited to, the nonsense 
mutations described in the references cited above, can be used in the present invention to 
identify compounds that mediate premature translation termination and/or nonsense- 
mediated mKNA decay. 

5.8.16 Lysosomal Storage Diseases 

There are more than 40 individually recognized lysosomal storage disorders. 

Each disorder results from a deficiency in the activity of a specific enzyme, which impedes 

the lysosome from carrying out its normal degradative role. These include but are not 

hmited to the diseases listed subsequently. Aspartylglucosaminuria is caused by a 

deficiency of N-aspartyl-beta-glucosaminidase (Fisher et al., 1990, FEBS Lett. 269:440- 

444); cholesterol ester storage disease (Wohnan disease) is caused by mutations in the UFA 
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gene (Fujiyama et al., 1996, Hum. Mutat. 8:377-380); mutations in the CTNS gene are 

associated with cystinosis (Town et al, 1998, Nature Genet 18:319-324); mutations in a- 

galactosidase A are associated with Fabry disease (Eng et al.,1993, Pediat Res. 33:128A; 

Sakuraba et al., 1990, Am. J. Hum. Genet. 47:784-789; Davies et al., 1993, Hum. Molec. 

5 Genet 2:1051-1053; Miyamura et al., 1996, J. Clin. Invest 98:1809-1817); fiicosidosis is 

caused by mutations in the FUCAl gene(Kretz et al., 1989, J. Molec. Neurosci. 1:177-180; 

Yang et al., 1992, Biochem. Biophys. Res. Commun. 189:1063-1068; Seo et at, 1993, 

Hum. Molec. Genet 2:1205-1208); mucolipidosis type I resTilts from mutations in the 

NEUl gene (Bonten et al.,1996, Genes Dev. 10:3156-3169); mucoUpidosis type IV results 

10 from mutations in theMCOLNl gene (Bargal et al., 2000, Nature Genet 26:120-123; Sun et 
al., 2000JIum. Molec. Genet 9:2471-2478); Mucopolysaccharidosis type I (Hurler 
syndrome) is caused by mutations in the IDUA gene (Scott et at, 1992, Genomics 13:1311- 
1313;Bach et al., 1993, Am. J. Hum. Genet 53:330-338); Mucopolysaccharidosis type II 
(Hunter syndrome) is caused by mutations in the IDS gene (Sukegawa et al., 1992, 

15 Biochem. Biophys. Res. Commun. 183:809-813; Bunge et al., 1992 Hum. Molec. Genet. 
1:335-339; Flomen et al., 1992, Genomics 13:543-550); mucopolysaccharidosis type 25IIIB 
(Sanfilippo syndrome type A) is caused by mutations in the SGSH gene(Yogalingam et al., 
2001, Hum. Mutat. 18:264-281); mucopolysaccharidosis type IDB (Sanfilippo syndrome) is 
caused by mutations hi the NAGLU gene (Zhao et at, 1996, Proc. Nat Acad. Sci. 93:6101- 

20 6105; Zhao et al., 1995, Am. J. Hum. Genet 57:A185); mucopolysaccharidosis type TTTD is 
caused by mutations in the glucosamine-6-sulfatase (G6S) gene (Robertson et al., 1988, 
Hum. Genet 79:175-178); mucopolysaccharidosis type IVA (Morquio syndrome) is caused 
by mutations m the GALNS gene (Tomatsu et at, 1995, Am. J. Hum. Genet 57:556-563; 
Tomatsu et al.,1995. Hum. Mutat 6:195-196); mucopolysaccharidosis type VI (Maroteaux- 

25 Lamysyndrome) is caused by mutations in the ARSB gene (Litjens et at, 1992, Hum. 
Mutat. 1:397-402; Isbrandt et al., 1996, Hum. Mutat 7:361-363); mucopolysaccharidosis 
type Vn (Sly syndrome) is caused by mutations in the beta-glucuromdase (GUSB) 
gene(Yamada et al., 1995, Hum. Molec. Genet 4:651-655); mutations m CLNl (PPTl) 
cause infantile neuronal ceroid lipofuscinosis (Das et al., 1998 J. Clm. luvest 102:361-370; 

30 Mitchison et al., 1998, Hum. Molec. Genet 7:291-297); late mfantile type ceroid 
lipofuscinosis is caused by mutations in the CLN2 gene (Sleat et al., 1997, Science 
277:1802-1805); juvenile neuronal ceroid lipofuscinosis (Batten disease) is caused 
bymutations in the CLN3 gene (Mole et al,, 1999, Hum. Mutat 14: 199-215); late 
infantileneuronal ceroid lipofuscinosis, Finnish variant, is caused by mutations in the CLN5 

35 gene(Savukoski et al., 1998, Nature Genet 19:286-288); late-mfantile form of neuronal 
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ceroid lipofuscinosis is caused by mutations in the CLN6 gene (Gao et al., 2002, Am. 
J.Hum. Genet. 70:324-335); Niemann-Pick disease is caused by mutations in the ASM gene 
(Takahashi et al., 1992, J. Biol. Chem. 267:12552-12558; types A and B) and theNPCl 
gene (Millat et al., 2001, Am. J. Hum. Genet. 68:1373-1385; type C); Kanzaki disease is 
5 caused by mutatiotis in the NAGA gene (Keulemans et et al., 1996, J. Med.Genet. 33 :458- 
464); Gaucher disease is caused by mutations in the GBA gene (Stone, et al., 1999, Europ. 
J. Hum. Genet. 7:505-509); Glycogen storage disease H is the prototypic lysosomal storage 
disease and is caused by mutations in the GAA gene(Becker et al., 1998, Am. J. Hum. 
Genet. 62:991-994); Krabbe disease is caused by mutations in the GALC gene (Sakai et al., 

10 1994, Biochem, Biophys. Res. Commun. 198:485-491); Tay-Sachs disease is caused by 
mutations in the HEXA gene (Akli et al., 1991, Genomics 1 1:124-134; Mules et al., 1992, 
Am. J. Hum. Genet. 50: 834-841 ;Triggs-Raine et al., 1991, Am. J. Hum. Genet. 49:1041- 
1054; Drucker et al., 1993, Hum. Mutat. 2:415-417; Shore et al., 1992, Hum. Mutat 1:486- 
490); mutations in the GM2Agene causes Tay-Sachs variant AB (Schepers et aL, 1996, Am. 

15 J. Hum. Genet. 59:1048-1056; Chen et al., 1999, Am. J. Hum. Genet. 65:77-87); mutations 
in the HEXB gene cause SandhoflF disease (Zhang et al., 1994, Hum Mol Genet 3:139-145); 
alphamannosidosis type n is caused by mutations in the MAN2B1 gene (Gotoda et al., 
1998, Am. J. Hum. Genet. 63:1015-1024; Autio et al., 1973, ActaPaediat. Scand. 62:555- 
565); metachromatic leukodystrophy is caused by mutations in the ARS A gene(Gieselmann 

20 et al., 1994, Hum. Mutat. 4:233-242). Any gene containing a premature translation codon 
implicated in lysosomal storage disease disorders including, but not limited to, the nonsense 
mutations and genes described in the references cited above, can be used in the present 
invention to identify compounds that mediate premature translation termination and/or 
nonsense-mediated mRNA decay. 

25 5.8.17 Tuberous Sclerosis 



characterized by the presence of hamartomata in multiple organ systems. The disease is 
caused bymutations in TSCl (van Slegtenhorst et aL, 1997 Science 277:805-808; Sato et 
aL, 2002, J. Hum. Genet. 47:20-28) and/or TSC2 (Vrtel et aL, 1996, J. Med. Genet. 33:47- 
30 51; Wilson et aL, 1996, Hum. Molec. Genet. 5:249-256; Au et aL, 1998, Am. J.Hum. 

Genet. 62:286-294; Verhoef aL, 1999, Europ. J. Pediat. 158:284-287;Carsmo et aL, 2000, 
Proc. Nat Acad. Sci. 97:6085-6090). Any gene containing a premature translation codon 
implicated in tuberous sclerosis including, but not limited to, the nonsense mutations 
described in the references cited above, can be used in the present invention to identify 



Tuberous sclerosis complex (TSC) is a dominantly inherited disease 
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compounds that mediate premature translation termination and/or nonsense-mediated 
mRNA decay. 

5.9 Secondary Biological Screens 

5.9.1 In Vitro Assays 
5 The compounds identified in the nonsense suppression assay (for 

conyenience referred to herein as a "lead" compound) can be tested for biological actiyity 
using host cells containing or engineered to contain the target KNA element coupled to a 
functional readout system. 

5.9.1,1 Reporter Gene Assays 

10 The lead compound can be tested in a host cell engineered to contain the 

SNA with a premature translation termination codon controlling the expression of a 
reporter gene. In this example, the lead compounds are assayed in the presence or absence 
of the RNA with the premature translation termination codon. Compounds that modulate 
premature translation termination and/or nonsense-mediated mRNA decay will result in 

15 increased expression of the full-length gene, i.e., past the premature termination codon. 
Alternatively, a phenotypic or physiological readout can be used to assess activity of the 
target RNA with the premature translation termination codon in the presence and absence of 
the lead compound. In another embodiment of the invention, the compounds identified in 
the nonsense suppression assay (for convenience referred to herein as a "lead" compound) 

20 can also be tested for biological activity using an in vitro transcribed RNA firom the gene 
with a premature translation termination codon and subsequent in vitro translation of that 
RNA in a cell-firee translation extract. The activity of the lead compound in the in vitro 
translation mixture can be determined by any method that measures increased expression of 
the full-length gene» i.e., past the premature termination codon. For example, expression of 

25 a functional protein firom the full-length gene {e.g. , a reporter gene) can be measured to 
determine the effect of the lead compound on premature translation termination and/or 
nonsense-mediated noRNA decay in an in vitro system. Both the in vitro and in 
vivo nonsense suppression assays described in Intemational Patent Publication No. WO 
01/44516 and Intemational Patent Application No. PCT/US03/19760, each of which is 

30 incorporated by reference in its entirety. 

5.9.1.1.1 Reporter Gene Constrncts. Transfected Cells and Cell-Free Extracts 

The invention provides for reporter genes to ascertain the ejBFects of a 
compound on premature translation termination and/or nonsense-mediated mRNA decay. 
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In general, the level of expression and/or activity of a reporter gene product is indicative of 
the effect of the compound on premature translation termination and/or nonsense-mediated 
mRNA decay. 

The invention provides for specific vectors comprising a reporter gene 
5 operahly linked to one or more regulatory elements and host cells transfected with the 
vectors. The invention also provides for the in vitro translation of a reporter gene flanked 
by one or more regulatory elements. A reporter gene may or may not contain a premature 
stop codon depending on the assay conducted. Techniques for practicing this specific 
aspect of this invention will employ, unless otherwise indicated, conventional techniques of 

10 molecular biology, microbiology, and recombinant DNA manipulation and production, 
which are routinely practiced by one of skill in the art. See, e.g.^ Sambrook, 1989, 
Molecular Cloning, A Laboratory Manual, Second Edition; DNA Cloning, Volumes I and n 
(Glover, Ed. 1985); Oligonucleotide Synthesis (Gait, Ed. 1984); Nucleic Acid 
Hybridization (Hames & Higgins, Eds. 1984); Transcription and Translation (Hames & 

15 Higgins, Eds. 1984); Animal Cell Culture (Freshney, Ed. 1986); hnmobilized Cells and 
Enzymes (IRL Press, 1986); Peibal, A Practical Guide to Molecular Cloning (1984); Gene 
Transfer Vectors for Mammahan Cells (Miller & Calos, Eds. 1987, Cold Spring Harbor 
Laboratory); Methods in Enzjmology, Volimies 154 and 155 (Wu & Grossman, and Wu, 
Eds., respectively), (Mayer & Walker, Eds., 198^^; Immunochemical Methods in Cell and 

20 Molecular Biology (Academic Press, London, Scopes, 1987), Expression of Proteins in 

Mammalian Cells Using Vaccinia Viral Vectors in Current Protocols in Molecular Biology, 
Volume 2 (Ausubel et al., Eds., 1991). 

5.9.1.1.1.1 Reporter Genes 

Any reporter gene well-known to one of skill in the art may be used in 
25 reporter gene constructs to ascertain the effect of a compound on premature translation 

termination. Reporter genes refi^ to a nucleotide sequence encoding a protein, polypeptide 
or peptide that is readily detectable either by its presence or activity. Reports genes may 
be obtained and the nucleotide sequence of the elements determined by any method well- 
known to one of skill in the art. The nucleotide sequence of a reporter gene can be 
30 obtained, e,g,, from the literature or a database such as GenBank. Alternatively, a 

polynucleotide encoding a reporter gene may be generated from nucleic acid from a suitable 
source. If a clone containing a nucleic acid encoding a particular reporter gene is not 
available, but the sequence of the reporter gene is known, a nucleic acid encoding the 
reporter gene may be chemically synthesized or obtained from a suitable source (e,g,, a 
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cDNA library, or a cDNA library generated from, or nucleic acid, preferably poly A+ RNA, 
isolated from, any tissue or cells expressing the reporter gene) by PCR amplification. Once 
the nucleotide sequence of a reporter gene is determined, the nucleotide sequence of the 
reporter gene may be manipulated using methods well-known in the art for the manipulation 
5 of nucleotide sequences, e,g., recombinant DNA techniques, site directed mutagenesis, 
PGR, etc. (see, for example, the techniques described in Sambrook et al., 1990, Molecular 
Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY and Ausubel et al^ eds., 1998, Current Protocols in Molecular Biology, John 
Wiley 8c Sons, NY, which are both incorporated by reference herein in their entireties), to 

10 generate rqporter genes having a different amino acid sequence, for example to create 
amino acid substitutions, deletions, and/or insertions. 

In a specific embodiment, a reporter gene is any naturally-occurring gene 
with a premature stop codon. Genes with premature stop codons that are useful in the 
present invention include, but are not limited to, the genes described below. In an 

15 alternative embodiment, a reporter gene is any gene that is not known in nature to contain a 
prCTfiature stop codon. Examples of reporter genes include, but are not limited to, luciferase 
(e.g., firefly luciferase, renilla luciferase, and click beetle luciferase), green fluorescait 
protein ("GFF') {e,g,, green fluorescent protein, yellow fluorescent protein, red fluorescent 
protein, cyan fliiorescent protein, and blue fluorescent protein), beta-galactosidase ('"beta- 

20 gal**)? beta-glucoronidase, beta-lactamase, chloramphenicol acetyltransferase ("CAT*), and 
alkaline phosphatase ("AP"). Alternatively, a reporter gene can also be a protein tag, such 
as, but not limited to, myc. His, FLAG, or GST, so that nonsense suppression will produce 
the peptide and the protein can be monitored by an ELISA, a westem blot, or any other 
immunoassay to detect the protein tag. Such methods are well known to one of skill in the 

25 art. In a preferred embodiment, the reporter gene is easily assayed and has an activity 
which is not normally found in the gene of interest. Table 27 below lists various reporter 
genes and the properties of the products of the reporter genes that can be assayed. In a 
preferred embodiment, a reporter gene utilized in the reporter constructs is easily assayed 
and has an activity which is not normally foimd in the cell or organism of interest 

30 

TABLE 27: Rqporter Genes and the Properties of the Reporter Gene Products 



Reporter Gene 


Protein Activity & Measurement 


CAT (chloramphenicol acetyltransferase) 


Transfers radioactive acetyl groups to 
chloramphenicol or detection by thin layer 
chromatography and autoradiography 
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Reporter G^e 


Frotein Activity & Measurement 


GAL (beta-galactosidase) 


W 41 11 1al« m ^ ^ 

Hydrolyzes colorless galactosides to yield 
colored products. 


GUS (beta-glucuronidase) 


Hydrolyzes colorless glucuronides to yield 
colored products. 


LUC (luciferase) 


Oxidizes luciferin, emitting photons 


GFP (green fluorescent protein) 


Fluorescent protein without substrate 


SEAP (secreted alkaline phosphatase) 


Luminescence reaction with suitable substrates 
or with substrates that generate chromophores 


HRP (horseradish peroxidase) 


In the presence of hydrogen oxide, oxidation of 
3,3*,5,5'-tetramethylbenzidine to form a colored 
complex 


AP (alkaline phosphatase) 


Lumiaescence reaction with suitable substrates 
or with substrates that generate chromophores 



Described hereinbelow in further detailed are specific reporter genes and 
characteristics of those reporter genes. 

Luciferase 

Luciferases are enzymes that emit Ugiht in the presence of oxygen and a 
5 substrate (luciferin) and which have been used for real-time, low-Ught imaging of gene 
expression in cell cultures, individual cells, whole organisms, and transgenic organisms 
(reviewed by Greer & Szalay, 2002, Luminescence 17(l):43-74). 

As used herein, the term 'luciferase" is intended to embrace all luciferases, 
or recombinant enzymes derived from luciferases which have luciferase activity. The 

10 luciferase genes from fireflies have been well characterized, for example, from the Photinus 
saidLuciola species (see, e.g., Intemational Patent Publication No. WO 95/25798 for 
Photinus pyralis, European Patent Application No, EP 0 524 448 for Luciola cruciata and 
Luciola lateralis^ andDevine et al., 1993, Biochim. Biophys. Acta 1173(2): 121-132 for 
Luciola mingrelica). Other eucaryotic luciferase genes include, but are not limited to, the 

15 click beetle {Photinus plagiophthalamus, see, eg:. Wood et al., 1989, Science 244:700- 
702), the sea panzy {Renilla reniformis, see, e.g., Lorenz et al., 1991, Proc Natl Acad Sci U 
S A 88(10):4438-4442), and the glow worm {Lampyris noctiluca, see e,g., Sula-Newby et 
al., 1996, Biochem J. 313:761-767). The cMck beetie is miusual in that diflEerent members 
of the species emit bioluminescence of different colors, which emit light at 546 nm (green), 

20 560 nm (yellow-green), 578 nm (yellow) and 593 run (orange) (see, e.g, U.S. Patent Nos. 

6,475,719; 6,342,379; and 6,217,847, the disclosures of which are incorporated by reference 

in their entireties). Bacterial luciferin-luciferase systems include, but are not limited to, the 
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bacterial lux genes of terrestrial Photorhabdus luminescens (see, eg., Maniikhov et al., 
2000, Genetika 36(3):322-30) and marine bacteria Vibrio fischeri and Vibrio harveyi (see, 

Miyamoto et al., 1988, J Biol Chem. 263(26):13393-9, and Cohn et al., 1983, Proc Natl 
Acad Sci USA., 80(1): 120-3, respectively). The luciferases encompassed by the present 
5 invention also includes the mutant luciferases described in U.S. Patent No. 6,265,177 to 
Squirrell et al., which is hereby incorporated by reference in its entirety. 

In a specific embodiment, the ludferase is a firefly luciferase, a renilla 
luciferase, or a cUck beetle luciferase, as described in any one of the references listed supra, 
the disclosures of which are incorporated by reference in their entireties. 

10 Green Flnorescent Protein 

Green fluorescent protein ("GFP") is a 238 amino acid protein with amino 
acid residues 65 to 67 involved in the formation of the chromophore which does not require 
additional substrates or cofactors to fluoresce (see, e.g,, Prasher et al., 1992, Gene 111:229- 
233; Yang et al., 1996, Nature Biotechnol, 14:1252-1256; and Cody et al., 1993, 

15 Biochemistry 32:1212-1218). 

As used herein, the term "green fluorescent protein" or "GFP" is intended to 
embrace all GFPs (including the various forms of GEPs which exhibit colors other than 
green), or recombinant enzymes derived from GFPs which have GFP activity. In a 
preferred embodiment, GFP includes green fluorescent protein, yellow fluorescent protein, 

20 red fluorescent protein, cyan fluorescent protein, and blue fluorescent protein. The native 
gene for GFP was cloned from the bioluminescent jellyfish y^eg^wor^a victoria (see, e.g., 
Morin et al., 1972, J. Cell Physiol. 77:313-318). Wild type GFP has a major excitation peak 
at 395 nm and a minor excitation peak at 470 nm. The absorption peak at 470 nm allows 
the monitoring of GFP levels using standard fluorescein isothiocyanate (FITC) filter sets. 

25 Mutants of the GFP gene have been found useful to enhance expression and to modify 

excitation and fluorescence. For example, mutant GFPs with alanine, glycine, isoleucine, or 
threonine substituted for serine at position 65 result in mutant GFPs with shifts in excitation 
maxima and greater fluorescence than wild type protein when excited at 488 nm (see, eg., 
Heim et al., 1995, Nature 373:663-664; U.S. Patent No. 5,625,048; Delagrave et al., 1995, 

30 Biotechnology 13:151-154; Cormack et al., 1996, Gene 173:33-38; and Cramer et al., 1996, 
Nature Biotechnol. 14:315-319). The abiUty to excite GFP at 488 nm permits the use of 
GFP with standard fluorescence activated cell sorting ('TAGS") equipment In another 
embodiment, GFPs are isolated from organisms other than the jellyfish, such as, but not 
limited to, the sea pansy, Renilla reriformis. 
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Techniques for labeling cells with GFP in general are described in U.S. 
Patent Nos. 5,491,084 and 5,804,387, which are incorporated by reference in their 
entireties; Chalfie et al, 1994, Science 263:802-805; Heim et al., 1994, Proc. Natl. Acad. 
Sci. USA 91:12501-12504; Morise et al, 1974, Biochemistry 13:2656-2662; Ward et al., 
5 1980, Photochem. Photobiol. 31:611-615; Rizzuto et al., 1995, Curt. Biology 5:635-642; 
andKaether&Gerdes, 1995, FEES Lett 369:267-271. The expression of GFPs in £. coli 
and C elegans are described in U.S. Patent No. 6,251,384 to Tan et al., which is 
incorporated by reference in its entirety. The expression of GFP in plant cells is discussed 
in Hu & Cheng, 1995, FEES Lett 369:331-33, and GFP expression in Drosophila is 
10 described in Davis et al., 1995, Dev. Biology 170:726-729. 

Beta Galactosidase 

Beta galactosidase C1)eta-gal") is an enzyme that catalyzes the hydrolysis of 
beta-galactosides, including lactose, and the galactoside analogs o-nitrophenyl-beta-D- 
galactopyranoside ("ONPG") and chlorophenol red-beta-D-galactopyranoside ("CPRG") 

15 (see, e.g,, Nielsen et al., 1983 Proc Natl Acad Sci USA 80(1 7):5 198-5202; Eustice et al., 
1991, Biotechniques 11:739-742; and Henderson et al., 1986, Clin. Chem. 32:1637-1641). 
The beta-gal gene functions well as a reporter gene because the protein product is extremely 
stable, resistant to proteolytic degradation in cellular lysates, and easily assayed. When 
ONPG is used as the substrate, beta-gal activity can be quantitated with a 

20 spectrophotometer or microplate reader. 

As used herein, the term ^^beta galactosidase" or ''beta-gal" is intended to 
embrace all beta-gals, including lacZ gene products, or recombinant enzymes derived from 
beta-gals which have beta-gal activity. The beta-gal gene functions well as a reporter gene 
because flie protein product is extremely stable, resistant to proteolytic degradation in 

25 cellular lysates, and easily assayed. In an embodiment where ONPG is the substrate, beta- 
gal activity can be quantitated with a spectrophotometer or microplate reader to determine 
tiie amount of ONPG converted at 420 nm. In an embodiment when CPRG is the substrate, 
beta-gal activity can be quantitated with a spectrophotometer or microplate reader to 
determine the amount of CPRG converted at 570 to 595 nm. In yet another embodiment, 

30 tiie beta-gal activity can be visually ascertained by plating bacterial cells transformed with a 
beta-gal construct onto plates containing Xgal and IPTG. Bacterial colonies that are dark 
blue indicate the presence of high beta-gal activity and colonies fliat are varying shades of 
blue indicate varying levels of beta-gal activity. 
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Beta-GlucuroBidase 

Beta-glucuronidase ("GUS") catalyzes the hydrolysis of a very wide variety 
of beta-glucuronides, aad, with much lower efficiency, hydrolyzes some beta- 
galacturonides. GUS is very stable, will tolerate many detergents and widely varying ionic 
conditions, has no cofactors, nor any ionic requiremmts, can be assayed at any 
physiological pH, with an optiminn between 5.0 and 7,8, and is reasonably resistant to 
thermal inactivation (see, e,g., U.S. Patent No. 5,268,463, which is incoiporated by 
reference in its entirety). 

In one embodiment, the GUS is derived from the Esherichia coli beta- 
glucuronidase gene. In alternate embodiments of the invention, the beta-glucuronidase 
encoding nucleic acid is homologous to the E. coli beta-glucuronidase gene and/or may be 
derived from another organism or species. 

GUS activity can be assayed either by fluorescence or spectrometry, or any 
other method described in U.S. Patent No. 5,268,463, the disclosure of which is 
incorporated by reference in its entirety. For a fluorescent assay, 4- 
trifluoromethylumbelUferyl beta-D-glucuronide is a very sensitive substrate for GUS. The 
fluorescence maximum is close to 500 nm~bluish green, where very few plant compoimds 
fluoresce or absorb. 4-trifluoromethylumbelliferyl beta-D-glucuronide also fluoresces 
much more strongly near neutral pH, allowing continuous assays to be performed more 
readily than with MUG. 4-trifluoromethylumbelliferyl beta-D-glucuronide can be used as a 
fluorescent indicator in vivo. The spectrophotometric assay is very straightforward and 
moderately sensitive (Jefferson et al., 1986, Proc. Natl. Acad. Sci. USA 86:8447-8451). A 
preferred substrate for spectrophotometric measurement is p-nitrophenyl beta-D- 
glucuronide, which when cleaved by GUS releases the chromophore p-nitrophenol. At a 
pH greater than its pKa (around 7.15) the ionized chromophore absorbs light at 400-420 nm, 
giving a yellow color. 

Beta-Lactamases 

Beta-lactamases are nearly optimal enzjmies in respect to tiieir almost 
diflEiision-controlled catalysis of beta-lactam hydrolysis, making them suited to the task of 
an intracellular reporter enzyme (see, e^., Christensen et al., 1990, Biochem. J. 266: 853- 
861). They cleave the beta-lactam ring of beta-lactam antibiotics, such as penicillins and 
cephalosporins, generating new charged moieties in the process (see, O'Callaghan et 
al., 1968, Antimicrob. Agents. Chemother. 8: 57-63 and Stratton, 1988, J. Antimicrob. 
Chemother. 22, Suppl. A: 23-35). A large number of beta-lactamases have been isolated 
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and characterized, all of which would be suitable for use in accordance with the present 
invention (see, e.g., Richmond & Sykes, 1978, Adv.Microb.Physiol. 9:31-88 and Ambler, 
1980, Phil. Trans. R. Soc. Lend. [Ser.B.] 289: 321-331, the disclosures of which are 
incorporated by reference in their entireties). 
5 The coding region of an exemplary beta-lactamase employed has been 

described in U.S. Patent No. 6,472,205, Kadonaga et al., 1984, J.Biol.Chem. 259: 2149- 
2154, and SutcUfife, 1978, Proc. Natl. Acad. Sci. USA 75: 3737-3741, the disclosures of 
which re incorporated by reference m theu- entketies. As would be readily apparent to those 
skilled in the field, this and other comparable sequences for peptides having beta-lactamase 

10 activity would be equally suitable for use in accordance with the present invention. The 
combination of a fluorogenic substrate described in U.S. Patent Nos. 6,472,205, 5,955,604, 
and 5,741,657, the disclosures of which are incorporated by reference in their entireties, and 
a suitable beta-lactamase can be employed in a wide variety of different assay systems, such 
as are described in U.S. Patent No. 4,740,459, which is hereby incorporated by reference in 

15 its entirety. 

Chloramphenicol Acetvltransferase 

Chloramphenicol acetyl transferase ("CAT") is commonly used as a reporter 
gene in mammalian cell systems because mammalian cells do not have detectable levels of 
CAT activity. The assay for CAT involves incubating cellular extracts with radiolabeled 
20 chloramphenicol and appropriate co-factors, separating the starting materials from the 
product by, for example, thin layer chromatography C'TLC"), followed by scintillation 
counting (see, e.^., U.S. Patent No. 5,726,041, which is hereby incorporated by reference in 
its entirety). 

As used herem, the temi "chloramphenicol acetyltransferase" or "CAT" is 
25 intended to embrace all CATs, or recombinant enzymes derived from CAT which have 
CAT activity. While it is preferable that a reporter system which does not require cell 
processing, radioisotopes, and chromatographic separations would be more amenable to 
high through-put screening, CAT as a reporter gene may be preferable in situations when 
stability of the reporter gene is important. For example, the CAT r^orter protein has an in 
30 vivo half life of about 50 hours, which is advantageous when an accumulative versus a 
dynamic change type of result is desired. 

Secreted Alkaline Phosphatase 
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The secreted alkaline phosphatase ("SEAP") enzyme is a truncated form of 
alkaline phosphatase, in which the cleavage of the transmembrane domain of the protein 
allows it to be secreted from the cells into the smroimding media. In a preferred 
embodiment, the alkaline phosphatase is isolated from human placenta. 
5 As used herein, the term "secreted alkaline phosphatase" or "SEAP" is 

intended to embrace all SEAP or recombinant enzymes derived &om SEAP which have 
alkaline phosphatase activity. SEAP activity can be detected by a variety of methods 
including, but not limited to, measurement of catalysis of a fluorescent substrate, 
immunoprecipitation, HPLC, and radiometric detection. The luminescent method is 

10 preferred due to its increased sensitivity over calorimetric detection methods. The 

advantages of using SEAP is that a cell lysis step is not required since the SEAP protem is 
secreted out of the cell, which facilitates the automation of sampling and assay procedures. 
A cell-based assay using SEAP for use in cell-based assessment of inhibitors of the 
Hepatitis C virus protease is described in U.S. Patent No. 6,280,940 to Potts et al. which is 

1 5 hereby incorporated by reference in its entirety. 

5.9.1.1.1.2 Stop Codons 
The present invention provides for methods for screening and identifying 
compounds that modulate premature translation termination and/or nonsense-mediated 

20 mRNA decay. A reporter gene may be engineered to contain a premature stop codon or 
may naturally contain a premature stop codon. Alternatively, a protein, polypeptide or 
peptide that regulates (directly or indirectly) the expression of a reporter gene may be 
engineered to contain or may naturally contain a premature stop codon. The premature stop 
codon may any one of the stop codons known in the art including UAG, UAA and UGA. 

25 In a specific embodiment, a reporter gene contains or is engineered to 

contain the premature stop codon UAG. In another embodiment, a reporter gene contains 
or is engineered to contain the premature stop codon UGA. 

In a particular embodiment, a reporter gene contains or is engineered to 
contain two or more stop codons. In accordance with this embodiment, the stop codons are 

30 preferably at least 10 nucleotides, at least 15 nucleotides, at least 20 nucleotides, at least 25 
nucleotides, at least 30 nucleotides, at least 35 nucleotides, at least 40 nucleotides, at least 
45 nucleotides, at least 50 nucleotides, at least 75 nucleotides or at least 100 nucleotides 
apart from each other. Further, in accordance with this embodiment, at least one of the stop 
codons is preferably UAG or UGA. 
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In a specific embodiment, a reporter gene contains or is engineered to 
contain a premature stop codon at least 15 nucleotides, preferably at least 20 nucleotides, at 
least 25 nucleotides, at least 30 nucleotides, at least 35 nucleotides, at least 40 nucleotides, 
at least 45 nucleotides, at least 50 nucleotides or at least 75 nucleotides jfrom the start codon 
5 in the coding sequence. In another embodimmt, a reporter gene contains or is engineered to 
contain a premature stop codon at least 15 nucleotides, preferably at least 25 nucleotides, at 
least 50 nucleotides, at least 75 nucleotides, at least 100 nucleotides, at least 125 
nucleotides, at least 150, at least 175 nucleotides or at least 200 nucleotides from the native 
stop codon in the coding sequence of the fiill-length reporter gene product or protein, 

10 polypeptide or peptide. In another embodiment, a reporter gene contains or is engineered to 
contain a premature stop codon at least 15 nucleotides (preferably at least 20 nucleotides, at 
least 25 nucleotides, at least 30 nucleotides, at least 35 nucleotides, at least 40 nucleotides, 
at least 45 nucleotides, at least 50 nucleotides or at least 75 nucleotides) firom the start 
codon in the coding sequence and at least 15 nucleotides (preferably at least 25 nucleotides, 

15 at least 50 nucleotides, at least 75 nucleotides, at least 100 nucleotides, at least 125 

nucleotides, at least 150, at least 175 nucleotides or at least 200 nucleotides) from the native 
stop codon in the coding sequence of the full-length reporter gene product or protein, 
polypeptide or peptide. In accordance with these embodiments, the premature stop codon is 
preferably UAG or UGA. 

20 The premature translation stop codon can be produced by in vitro mutagenesis 

techniques such as, but not limited to, polymerase chain reaction CTCR")> linker insertion, 
oligonucleotide-mediated mutagenesis, and random chemical mutagenesis. 

5,9.1,1.1.3 Vectors 

The nucleotide sequence encoding for a protein, polypeptide or peptide (eg., 
25 a reporter gene), can be inserted into an appropriate e3q>ression vector, i,e,, a vector which 
contains the necessary elements for the transcription and translation of the inserted protein- 
coding sequence. The necessary transcriptional and translational elements can also be 
supplied by the protein, polypeptide or peptide. The regulatory regions and enhancer 
elements can be of a variety of origins, both natural and synthetic. In a specific 
30 embodiment, a reporter gene is operably linked to regulatory element that is responsive to a 
regulatory protein whose e?q>ression is dependent upon the suppression of a premature stop 
codon. 

A variety of host-vector systems may be utilized to express a protein, 
polypeptide or peptide. These include, but are not limited to, mammalian cell systems 
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infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with 
virus (eg., baculovirus); microorganisms such as yeast containing yeast vectors, or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA; and stable cell lines 
generated by transformation using a selectable marker. The expression elements of vectors 
5 vary in their strengths and specificities. Dependmg on the host-vector system utilized, any 
one of a number of suitable transcription and translation elements may be used. 

Any of the methods previously described for the insertion of DNA firagments 
into a vector may be used to construct expression vectors containing a chimeric nucleic acid 
consisting of 2q)propriate transcriptional/translational control signals and the protein coding 

1 0 sequences. These methods may include in vitro recombinant DNA and synthetic techniques 
and in vivo recombinants (genetic recombination). Expression of a iBrst nucleic acid 
sequence encoding a protein, polypeptide or peptide, such as a reporter gene, may be 
regulated by a second nucleic acid sequence so that the first nucleic acid sequence is 
expressed in a host transformed with the second nucleic acid sequence. For example, 

15 expression of a nucleic acid sequence encoding a protein, polypeptide or peptide, such as a 
reporter gene, may be controlled by any promoter/enhancer element known in the art, such 
as a constitutive promoter, a tissue-specific promoter, or an inducible promoter. Specific 
examples of promoters which may be used to control gene expression include, but are not 
limited to, the SV40 early promoter region (Bemoist & Chambon, 1981, Nature 290:304- 

20 310), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 

(Y amamoto et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter (Wagner 
et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences of the 
metallothionein gene (Brinster et al., 1982, Nature 296:39-42); prokaryotic expression 
vectors such as the j8-lactamase promoter (Villa-Kamaroflfet al., 1978, Proc. Natl. Acad. 

25 Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer et al., 1983, Proc. Natl. Acad. Sci. 
U.S.A. 80:21-25); see also *TJsefiil proteins from recombinant bacteria" in Scientific 
American, 1980, 242:74-94; plant expression vectors comprising the nopaline synthetase 
promoter region (Herrera-Estrella et al.. Nature 303:209-213) or the cauliflower mosaic 
virus 35S RNA promoter (Gardner, et al., 1981, Nucl. Acids Res. 9:2871), and the promoter 

30 of the photosynthetic enzyme ribulose biphosphate carboxylase (Herrera-Estrella et al., 
1984, Nature 3 10: 1 15-120); promoter elements from yeast or other fimgi such as the Gal 4 
promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter, and the following animal transcriptional control 
regions, which exhibit tissue specificity and have been utilized in transgenic animals: 
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elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984, 
Cell 38:639-646; Omitz et al., 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987, Hepatology 7:425-5 15); insulin gene control region which is active in 
pancreatic beta cells (Hanahan, 1985, Nature 315:1 15-122), immunoglobulin gene control 
5 region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames 
et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. CeU. Biol. 7:1436-1444), 
mouse manamary tumor virus control region which is active in testicular, breast, lymphoid 
and mast cells (Leder et al., 1986, Cell 45:485-495), albumin gene control region which is 
active in liver (Pinkert et al., 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene 

10 control region which is active in liver (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; 
Hanmier et al., 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is 
active in the liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene 
control region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; 
KolUas et al., 1986, CeU 46:89-94; myelin basic protein gene control region which is active 

15 in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703-712); myosin light 
chain-2 gene control region which is active in skeletal muscle (Sard, 1985, Nature 314:283- 
286), and gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et al., 1986, Science 234:1372-1378). 

In a specific embodiment, a vector is used that comprises a promoter 

20 operably linked to a reporter gene, one or more origins of replication, and, optionally, one 
or more selectable markers {e.g., an antibiotic resistance gene). In a preferred embodiment, 
the vectors are CMV vectors, T7 vectors, lac vectors, pCEP4 vectors, 5.0/F vectors, or 
vectors with a tetracycline-regulated promoter (e.g., pcDNA™5/FRT/TO from Invitrogen). 
Some vectors may be obtained commercially. Non-limiting examples of useful vectors are 

25 described in Appendix 5 of Current Protocols in Molecular Biology, 1988, ed. Ausubel et 
al., Greene Publish, Assoc. & Wiley Interscience, which is incorporated herein by 
reference; and the catalogs of commercial suppUers such as Clontech Laboratories, 
Stratagene Inc., and Invitrogen, Inc. 

Expression vectors containing a construct of the present invention can be 

30 identified by the following general approaches: (a) nucleic acid hybridization, (b) presence 
or absence of "marker^' nucleic acid functions, (c) expression of mserted sequences, and (d) 
sequencing. Jn the first approach, the presence of a particular nucleic acid sequence 
inserted in an expression vector can be detected by nucleic acid hybridization using probes 
conq)rising sequences that are homologous to the inserted nucleic acid sequence. In the 
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second approach, the recombmant vector/host system can be identified and selected based 
upon the presence or absence of certain "marker^' nucleic acid functions (e.g., thymidine 
kinase activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, eta) caused by the insertion of the nucleic acid sequence of 
5 interest m the vector. For example, if the nucleic acid sequence of interest is inserted within 
. tiie marker nucleic acid sequence of the vector, recombinants containing the insert can be 
identified by the absence of tiie marker nucleic acid function. In the third q)proach, 
recombinant expression vectors can be identified by assaying the product expressed by the 
recombinant Such assays can be based, for example, on the physical or functional 
1 0 properties of the particular nucleic acid sequence. 

In a preferred embodiment, nucleic acid sequences encoding proteins, 
polypeptides or peptides are cloned into stable cell line expression vectors. In a preferred 
embodiment, the stable cell line expression vector contains a site specific genomic 
integration site. In another preferred embodiment, the reporter gene construct is cloned into 
IS an episomal mammalian expression vector. 

5.9.1.1.1.4 Transfection 

Once a vector encoding the appropriate gene has been synthesized, a host 
cell is transformed or transfected with the vector of interest. The use of stable 
transformants is preferred. In a preferred embodiment, the host cell is a mammalian cell. In 

20 a more preferred embodiment, the host cell is a human cell. In another embodiment, the 
host cells are primary cells isolated fi"om a tissue or other biological sample of interest. 
Host cells that can be used in the methods of the present invention include, but are not 
limited to, hybridomas, pre-B cells, 293 cells, 293T cells, HeLa cells, HepG2 cells, K562 
cells, 3T3 cells. In another preferred embodiment, the host cells are derived Jfrom tissue 

25 specific to the nucleic acid sequence encoding a protein, polypeptide or peptide. In another 
preferred embodiment, the host cells are immortalized cell lines derived firom a source, 
a tissue. Other host cells that can be used in the present invention include, but are not 
limited to, bactmal cells, yeast cells, virally-infected cells, or plant cells. 

Preferred manamalian host cells include but are not limited to those derived 

30 fi-om humans, monkeys and rodents, (see, for example, Kriegler M. in "Gene Transfer and 
Expression: A Laboratory Manual", New York, Freeman & Co. 1990), such as monkey 
kidney cell line transformed by SV40 (COS-7, ATCC Accession No. CRL 1651); human 
embryonic kidney cell lines (293, 293-EBNA, or 293 cells subcloned for growth in 
suspension culture, Graham et al., J. Gen. Virol., 36:59, 1977; baby hamster kidney cells 
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(BHK, ATCC Accession No. CCL 10); Chinese hamster ovary-cells-DHFR (CHO, Urlaub 
and Chasin. Proc. Natl. Acad. ScL 77; 4216, 1980); mouse sertoU cells (Mather, Biol. 
Reprod. 23:243-251, 1980); mouse fibroblast cells (N1H-3T3), monkey kidney cells (CVI 
ATCC Accession No. CCL 70); afidcan green monkey kidney cells (VERO-76, ATCC 
5 Accession No. CKL-1587); human cervical carcinoma cells (HELA, ATCC Accession No. 
CCL 2); canine kidney cells (MDCK, ATCC Accession No. CCL 34); buffalo rat liver cells 
(BRL 3A, ATCC Accession No. CRL 1442); human lung cells (W138, ATCC Accession 
No. CCL 75); human liver cells (Hep G2, HB 8065); and mouse mammary tumor cells 
(MMT 060562, ATCC Accession No. CCL51). 

10 Other usefiil eukaryotic host-vector system may include yeast and insect 

systems. Li yeast, a number of vectors containing constitutive or inducible promoters may 
be used with Saccharomyces cerevisiae (baker's yeast), Schizosaccharomyces pombe 
(fission yeast), Pichia pastoris, and Hansenula polymorpha (methylotropic yeasts). For a 
review see. Current Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel et al., 

15 Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant et al., 1987, Expression and 
Secretion Vectors for Yeast, in Methods in Enzymology, Eds. Wu & Grossman, 1987, 
Acad. Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA Cloning, Vol. H, IRL Press, 
Wash., D.C., Ch. 3; and Bitter,. 1987, Heterologous Gene Expression in Yeast, Methods in 
Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol. 152, pp. 673-684; and The 

20 Molecular Biology of the Yeast Saccharomyces, 1 982, Eds. Strathem et al., Cold Spring 
Harbor Press, Vols. I and n. 



cells can be used. Transformation may be by any known method for introducing 
polynucleotides into a host cell, including, for example packaging the polynucleotide in a 

25 virus and transducing a host cell with the vims, and by direct uptake of the polynucleotide. 
The transformation procedure used depends upon the host to be transformed. Mammalian 
transformations {Le., transfections) by direct uptake maybe conducted using the calcium 
phosphate precipitation method of Graham & Van der Eb, 1978, Virol. 52:546, or the 
various known modifications thereof. Other methods for introducing recombinant 

30 polynucleotides into cells, particularly into mammalian cells, include dextran-mediated 
transfection, calciiun phosphate mediated transfection, polybrene mediated transfection, 
protoplast fusion, electroporation, encapsulation of the polynucleotide(s) in liposomes, and 
direct microinjection of the polynucleotides into nuclei. Such methods are well-known to 
one of skill in the art. 



Standard methods of introducing a nucleic acid sequence of interest into host 
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In a preferred embodiment, stable cell lines containing the constructs of 
interest are generated for high throughput screening. Such stable cells lines may be 
generated by introduciag a construct comprising a selectable marker, allowing the cells to 
grow for 1-2 days in an enriched medium, and then growing the cells on a selective 
S medium. The selectable marker in the recombinant plasmid confers resistance to the 
selection and allows cells to stably integrate the plasmid into their chromosomes and grow 
to form foci which in turn can be cloned and expanded into cell lines. 

A number of selection systems may be used, including but not limited to the 
herpes simplex virus thymidine kinase (Wigler, et al., 1977, Cell 11:223), hypoxanthine- 

10 guanine phosphoribosyltransferase (Szybalska & Szybalski, 1962, Proc. Natl. Acad. Sci. 
USA 48:2026), and adenine phosphoribosyltransferase (Lowy, et aL, 1980, Cell 22:817) 
genes can be employed in tk-, hgprt- or aprt- cells, respectively. Also, anti-metabolite 
resistance can be used as the basis of selection for dhfr, which confers resistance to 
methotrexate (Wigler, et al., 1980, Natl. Acad. Sci, USA 77:3567; O'Hare, et al., 1981, 

15 Proc, Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to mycophenolic acid 
(Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garq)in, et al., 1981, J. MoL Biol. 
150:1); and hygro, which confers resistance to hygromycin (Santerre, et al., 1984, Gene 
30:147) gmes. 

20 5.9.1.1.1.5 Cell-Free Extracts 

The invention provides for the traaslation of a nucleic acid sequence 
encoding a protein, polypeptide or peptide (with or without a premature translation stop 
codon) in a cell-free system. Techniques for practicing the specific aspect of this invration 
will employ, unless otherwise indicated, conventional techniques of molecular biology, 

25 microbiology, and recombinant DNA manipulation and production, which are routinely 
practiced by one of skill in the art. See, e.g., Sambrook, 1989, Molecular Cloning, A 
Laboratory Manual, Second Edition; DNA Cloning, Volumes I and II (Glover, Ed. 1985); . 
and Transcription and Translation (Hames & Higgias, Eds. 1984). 

Any technique well-known to one of skill in the art may be used to generate 

30 cell-free extracts for translation. For example, the cell-free extracts can be generated by 
centrifiiging cells and clarifying the supernatant. In one embodiment, the cells are 
incubated on ice during the preparation of the cell-free extract In another embodiment, the 
cells are incubated on ice at least 12 hours, at least 24 hours, at least two days, at least five 
days, at least one week, at least longer than one week. In a more specific embodiment, the 
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cells are incubated on ice at least long enough so as to improve the translation activity of the 
cell extract in comparison to cell extracts that are not incubated on ice. In yet another 
embodiment, the cells are incubated at a temperature between about 0 ""C and 10 ^C. In a 
preferred embodiment, the cells are incubated at about 4 '^C. 
5 In another preferred embodiment, flie cells are centrifiiged at a low speed to 

isolate the cell-free extract for in vitro translation reactions. In a preferred embodiment, ttie 
cell extract is the supernatant from cells that axe centrifizged at about 2 x g to 20,000 x g. In 
a more preferred embodiment, the cell extract is the supernatant from cells that are 
centrifiiged at about 5 x g to 15,000 x g. In an even more prefOTed embodiment, the cell 

10 extract is the supernatant from cells that are centrifiiged at about 10,000 x g. Alternatively, 
in a preferred embodiment, the cell-free extract is about the SI to S50 extract, hi a more 
preferred embodiment, the cell extract is about the S5 to S25 extract. In an even more 
preferred embodiment, the cell extract is about the SIO extract. 

The cell-free translation extract may be isolated from cells of any species 

15 origin. In another embodiment, tihe cell-free translation extract is isolated from yeast, 
cultured mouse or rat cells, Chinese hamster ovary (CHO) cells, Xenopus oocytes, 
reticulocytes, wheat germ, or rye embryo (see, e.g., Krieg & Melton, 1984, Nature 308:203 
and Dignam ef aL, 1990 Methods EnzymoL 182:194-203). Alternatively, the cell-free 
translation extract, e.g., rabbit reticulocyte lysates and wheat germ extract, can be purchased 

20 from, Promega, (Madison, WT). In another embodiment, the cell-firee translation 

extract is prepared as described in Intemational Patent Publication No. WO 01/445 16 and 
U.S. Patent No. 4,668,625 to Roberts, the disclosures of which are incorporated by 
reference in their entireties. In a preferred embodiment, the cell-free extract is an extract 
isolated fix>m human cells. In a more preferred embodiment, the human cells are HeLa 

25 cells. It is preferred that the endogenous expression of the genes with the premature 

translation codons is minimal, and preferably absent, in the cells from which the cell-free 
translation extract is prepared. 

Systems for the in vitro transcription of RNAs with the gene of interest 
cloned in an expression vectors using promoters such as, but not limited to, Sp6, T3, or T7 

30 promoters (see, expression vectors from Invitrogen, Carlesbad, CA; Promega, 

Madison, WI; and Stratagene, La Jolla, CA), and the subsequent transcription of the gene 
with the appropriate polymerase are well-known to one of skill in the art (see, e.g., 
Contreras et al, 1982, Nucl. Acids. Res. 10:6353). In another embodiment, the gene 
encoding the prraiature stop codon can be PCR-amplified with the appropriate primers, 
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with the sequence of a promoter, such as but not limited to, Sp6, T3, or T7 promoters, 
incorporated into the upstream primer, so that the resulting amplified PCR product can be in 
vitro transcribed with the appropriate polymerase. 

Altematively, a coupled transcription-translation system can be used for the 
5 expression of a gene encoding a premature stop codon in a cell free extract, such as the 
TnT® Coupled Transcription/Translation System (Promega, Madison, Wl) or the system 
described in U.S. Patent No. 5,895,753 to Mierendorf et al, which is incorporated by 
reference in its entirety. 

5.9.1.1.2 Assays 

10 Various in vitro assays can be used to identify and verify the ability of a 

compound to modulate premature translation termination and/or nonsense-mediated mKNA 
decay. Multiple in vitro assays can be perforaied simultaneously or sequentially to assess 
the affect of a compound on premature translation tennination and/or nonsense-mediated 
mRNA decay. In a preferred embodiment, the in vitro assays described herein are 

1 5 performed in a high throughput format {e.g. , in microtiter plates). 

In a specific embodiment, the invention provides a method for identifying a 
compound that modulates premature translation termination and/or nonsense-mediated 
mRNA decay, said method comprising: (a) contacting a member of a library of compounds 
with a cell containing a nucleic add sequence comprising a reporter gene, wherein the 

20 reporter gene conrprises a premature stop codon; and (b) detecting the expression of said 
reporter gene, wherein a compound that modulates premature translation termination and/or 
nonsense-mediated niRNA decay is identified if the expression of said reporter gene in the 
presence of a compound is altered relative to a previously determined reference range, or 
the expression of said reporter gene ia the absence of said compound or the presence of an 

25 appropriate control (e.g., a negative control). 

In another embodiment, the invention provides a method for 
identifying a compound that modulates premature translation terxrdnation and/or nonsense- 
mediated mKNA decay, said method comprising: (a) contacting a member of a library of 
compounds with a cell-free extract and a nucleic acid sequoace comprising a reporter gene, 

30 wherein the reporter gene comprises a premature stop codon; and (b) detecting the 

expression of said reporter gene, wherein a compound that modulates premature translation 
termination and/or nonsense-mediated inRNA decay is identified if the expression of said 
reporter gene in the presence of a compound is altered relative to a previously determined 
reference range, or the expression of said reporter gene in the absence of said compound or 



103 



wo 2004/010106 PCT/US2003/023075 
the presence of an appropriate control (e.g., a negative control). In accordance with this 
embodiment, the cell-extract is preferably isolated from cells that have been incubated at 
about O^C to about lO^C and/or an SIO to S30 cell-free extract. 

The alteration in reporter gene expression and/or activity in the reporter gene 
assays relative to a previously determined reference range, or to the expression or activity of 
the reporter gene in the absence of the compoimd or the presence of an appropriate control 

a negative control such as phosphate buffered saline) indicates that a particxilar 
compound modulates premature translation termination and/or nonsense-mediated mKNA 
decay. In particular, an increase in rqjorter gene expression or activity relative to a 
previously determined reference range, or to the expression in the absence of the compound 
or the presence of an appropriate control (e.g., a negative control) may, depending upon the 
parameters of the reporter gene assay, indicate that a particular compound reduces or 
suppresses premature translation termination and/or nonsense-mediated mRNA decay. In 
contrast, a decrease in reporter gene expression or activity relative to a previously 
determined reference range, or to the expression in the absence of the compound or the 
presence of an appropriate control (e.g., a negative control) may, depending upon the 
parameters of the reporter gene-based assay, indicate that a particular con:q)omid enhances 
premature translation termination and/or nonsense-mediated mRNA decay. 

The step of contacting a compound or a member of a hbrary of compounds 
with cell in the reporter gene-based assays described herein is preferably conducted under 
physiologic conditions. In specific embodiment, a compound or a mCTaber of a hbrary of 
compounds is added to the cells in the presence of an aqueous solution. In accordance with 
this embodiment, the aqueous solution may comprise a buffer and a combination of salts, 
preferably approximating or mimicking physiologic conditions. Alternatively, the aqueous 
solution may comprise a buffer, a combmation of salts, and a detergent or a surfactant 
Examples of salts which may be used in the aqueous solution include, but not limited to, 
KCl, NaCl, and/or MgCl2. The optimal concentration of each salt used in the aqueous 
solution is dependent on the cells and compounds used and can be determined using routine 
experimentation. The step of contacting a compound or a member of a hbrary of 
compounds with a cell containing a reporter gene construct and in some circumstances, a 
nucleic acid sequence encoding a regulatory protein, may be performed for at least 0.2 
hours, 0.25 hours, 0.5 hours, 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 8 hours, 10 
hours, 12 hours, 18 hours, at least 1 day, at least 2 days or at least 3 days* 
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The expression of a reporter gene and/or activity of the protein encoded by 
the reporter gene in the reporter-gene assays may be detected by any technique welUcnown 
to one of skill in the art. The expression of a reporter gene can be readily detected, e.g.^ by 
quantifying the protein and/or KNTA encoded by said gene. Compounds that modulate 
S premature translation termination and/or nonsense-mediated mRNA decay may be 

identified by changes in the gene encoding the premature translation stop codon, Le.^ there 
is readthrough of the premature translation stop codon and a longer gene product is 
detected. If a gene encoding a naturally-occurring premature translation stop codon is used, 
a longer gene product in the presence of a compound that modulates premature translation 

10 termination and/or nonsense-mediated mKNA decay can be detected by any method in the 
art that permits the detection of the longer polypeptide, such as, but not limited to, 
immunological methods. 

Many methods standard in the art can be thus employed, including, but not 
Umited to, immunoassays to detect and/or visualize gene expression (eg.. Western blot, 

15 immunoprecipitation followed by sodium dodecyl sulfate polyacrylamide gel 

electrophoresis (SDS-PAGE), immunocytochemistry, radioimmunoassays, ELISA (enzyme 
linked immunosorbent assay), "sandwich" immunoassaj^, immunoprecipitation assays, 
precipitin reactions, gel difiusion precipitin reactions, immunodifEiision assays, 
agglutination assays, complement-fixation assays, inmiunoradiometric assays, fluorescent 

20 immimoassays, protein A immunoassays, or an epitope tag using an antibody that is specific 
to the polypeptide encoded by the gene of interest) and/or hybridization assays to detect 
gene expression by detecting and/or visualizing respectively mRNA encoding a gene (e.g., 
Norfhem assays, dot blots, in situ hybridization, etc), etc. Preferably, the antibody is 
specific to the C-teraiinal portion of the polypeptide used in an immunoassay. Such assays 

25 are routine and well known in the art (see, e.g., Ausubel et al, eds, 1994, Current Protocols 
in Molecular Biology, Vol. 1, John Wiley & Sons, Inc., New York, which is incorporated 
by reference herein in its entirety). Exemplary immunoassays are described briefly below . 
(but are not intended by way of limitation), 

Immunoprecipitation protocols generally comprise lysing a population of 

30 cells in a lysis buffer such as RIPA buffer (1% NP-40 or Triton X-100, 1% sodium 

deoxycholate, 0.1% SDS, 0.15 MNaCl, 0.01 M sodium phosphate at pH 7.2, 1% Trasylol) 
supplemented with protein phosphatase and/or protease inhibitors (e.^., EDTA, PMSF, 
aprotinin, sodium vanadate), adding the antibody which recognizes the antigen to the cell 
lysate, incubating for a period of time {e.g., 1 to 4 hours) at 40° C, adding protein A and/or 
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protein G sepharose beads to the cell lysate, incubating for about an hour or more at 40° C, 
washing the beads in lysis buffer and resuspending the beads in SDS/sampIe buffer. The 
ability of the antibody to immunoprecipitate a particular antigen can be assessed by, eg., 
western blot analysis. One of skill in the art woxdd be knowledgeable as to the parameters 
5 that can be modified to increase the binding of the antibody to an antigen and decrease the 
background (eg., pre-clearing the cell lysate with sepharose beads). For further discussion 
regarding immunoprecipitation protocols see, e.g., Ausubel et al, eds, 1994, Current 
Protocols in Molecular Biology, Vol. 1, John Wiley & Sons, Inc., New York at 10.16.1. 

Western blot analysis generally comprises preparing protein samples, 

10 electrophoresis of the protein samples in a polyacrylamide gel (eg., 8%- 20% SDS-PAGE 
depending on the molecular weight of the antigen), transferring the protein sample from the 
polyacrylamide gel to a membrane such as nitrocellulose, PVDF or nylon, blockiag the 
membrane in blocking solution (eg., PBS with 3% BSA or non-fat milk), washing the 
membrane in washing buffer (eg., PBS-Tween 20), blocking the membrane with primary 

15 antibody (the antibody which recognizes the antigen) diluted in blocking buffer, washing 
the membrane in washing buffer, blocking the membrane with a secondary antibody (which 
recognizes the primary antibody, eg., an anti-human antibody) conjugated to an enzymatic 
substrate (eg., horseradish peroxidase or alkalme phosphatase) or radioactive molecule 
(eg., ^^P or ^^J) diluted m blocking buffer, washing the membrane in wash buffer, and 

20 detecting the presence of the antigen. One of skill in the art would be knowledgeable as to 
the parameters that can be modified to increase the signal detected and to reduce the 
background noise. For further discussion regarding western blot protocols see, eg., 
Ausubel et al, eds, 1994, Current Protocols in Molecular Biology, Vol. 1, John Wiley & 
Sons, Inc., New York at 10.8.1. 

25 ELISAs comprise preparing antigen, coating the well of a 96 well microtiter 

plate with the antigen, adding a primary antibody (which recognizes the antigen) conjugated 
to a detectable compound such as an enzymatic substrate (e.g., horseradish peroxidase or 
alkaline phosphatase) to the well and incubating for a period of time, and detecting the 
presence of the antigen. In ELISAs the antibody of interest does not have to be conjugated 

30 to a detectable compound; instead, a second antibody (which recognizes the primary 

antibody) conjugated to a detectable confound may be added to the well. Further, instead 
of coating the well with the antigen, the antibody may be coated to the well. In this case, a 
second antibody conjugated to a detectable compound may be added following the addition 
of the antigen of interest to the coated well. One of skill in the art would be knowledgeable 
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as to the parameters that can be modified to increase the signal detected as well as oth^ 
variations of ELISAs known in llie art. For further discussion regarding ELISAs see, e.g., 
Ausubel et al, eds, 1994, Current Protocols in Molecular Biology, Vol. 1, John Wiley & 
Sons, Inc., New York at 1L2.1. 

Methods for detecting the activity of a protein encoded by a reporter gene 
will vary with the reporter gene used. Assays for the various reporter genes are well-known 
to one of skill in the art For example, as described in Section 5.1.L, luciferase, beta- 
galactosidase C*beta-gal"), beta-glucoronidase ("GUS'"), beta-lactamase, chloranq)hemcol 
acetyltransferase ("CAT"), and alkaline phosphatase C*AP") are enzymes that can be 
analyzed in the presence of a substrate and could be amenable to high throughput screening. 
For example, the reaction products of luciferase, beta-galactosidase ('Tjeta-gal"), and 
alkaline phosphatase ("AP") are assayed by changes in hght imaging (e.g., luciferase), 
spectrophotometric absorbance (e.g., beta-gal), or fluorescence (e.g., AP). Assays for 
changes in light output, absorbance, and/or fluorescence are easily adapted for high 
throughput screening. For example, beta-gal activity can be measured with a microplate 
reader. Green fluorescent protein ("GFP") activity can be measured by changes in 
fluorescence. For example, in the case of mutant GFPs that fluoresce at 488 mn, standard 
fluorescence activated cell sorting CTACS") equipment can be used to separate cells based 
vpon GFP activity. 

Changes in mKNA stability of the gene encoding the premature translation 
stop codon can be measured. As discussed above, nonsense-mediated mRNA decay alters 
the stability of an mKNA with a premature translation stop codon so that such mRNA is 
targeted for rapid decay instead of translation. In the presence of a compound that 
modulates premature translation termination and/or nonsense-mediated mRNA decay, the 
stability of the mRNA with the premature translation stop codon is likely altered, i.e., 
stabilized. Methods of measuring changes in steady state levels of mRNA are well-known 
to one of skill in the art. Such methods include, but are not limited to. Northern blots, dot 
blots, solution hybridization, RNase protection assays, and SI nuclease protection assays, 
wherein the steady state levels of the mRNA of interest are measured with an appropriately 
labeled nucleic acid probe. Altematively, methods such as semi-quantitative polymerase 
chain reaction ("PGR") can be used to measure changes in steady state levels of the mRNA 
of interest using the appropriate primers for amplification. 

Alterations in the expression of a reporter gene may be determined by 
comparing the level of expression and/or activity of the reporter gene to a negative control 
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(e.g., PBS or another agent that is known to have no effect on the expression of the reporter 
gene) and optionally, a positive control (e.g., an agent that is known to have an eflFect on the 
expression of the reporter gene, preferably an agent that effects premature translation 
termination and/or nonsense-mediated mRNA decay). Alternatively, alterations in the 
5 e}q)ression and/or activity of a reporter gene may be determined by comparing the level of 
expression and/or activity of the reporter gene to a previously determined reference range. 

5.9.1.2 Other In Vitro Assays 
Where the gene product of interest is involved in cell growth or viability, the 

10 in vivo effect of the lead compound can be assayed by measuring the cell growth or 

viability of the target cell. Such assays can be carried out with representative cells of cell 
types involved in a particular disease or disorder (e.g., leukocytes such as T cells, B cells, 
natural killer cells, macrophages, neutrophils and eosinophils). A lower level of 
proliferation or survival of the contacted cells indicates that the lead compound is effective 

15 to treat a condition in the patient characterized by uncontrolled cell growth. Altematively, 
instead of culturing cells from a patient, a lead compoimd may be screened using cells of a 
tumor or malignant cell line or an endothelial cell line. Specific examples of cell culture 
models include, but are not limited to, for lung cancer, primary rat limg tumor cells (see, 
e.g., Swafford et al., 1997, Mol. Cell. BioL, 17:1366-1374) and large-cell undifferentiated 

20 cancer cell lines (see, e.g., Mabry et al., 1991, Cancer Cells, 3:53-58); colorectal cell lines 
for colon cancer (see, e,g.. Park & Gazdar, 1996, J. Cell Biochem. Suppl. 24:131-141); 
multiple established cell lines for breast cancer (see, e,g,, Hambly et al., 1997, Breast 
Cancer Res. Treat. 43:247-258; Gierthy et al., 1997, Chemosphere 34:1495-1505; and 
Prasad & Church, 1997, Biochem. Biophys, Res. Commun. 232:14-19); a number of well- 

25 characterized cell models for prostate cancer (see, e.g., Webber et al., 1996, Prostate, Part 1, 
29:386-394; Part 2, 30:58-64; and Part 3, 30:136-142 and BouUkas, 1997, Anticancer Res. 
17:1471-1505); for genitourinary cancers, continuous human bladder cancer cell lines (see, 
e.g'., Ribeiro et al., 1997, Int. J. Radiat. Biol. 72:11-20); organ cultures of transitional cell 
carcinomas (see, e.g.. Booth et al., 1997, Lab Invest. 76:843-857) and rat progression 

30 models (see, e.g.. Vet et al., 1997, Biochim. Biophys Acta 1360:39-44); and established cell 
lines for leukemias and lymphomas (see, e^., Drexler, 1994, Leuk. Res. 18:919-927 and 
Tohyama, 1997, Int. J. Hematol. 65:309-317). 

Many assays well-known in the art can be used to assess flie survival and/or 
growth of a patient cell or cell line following exposure to a lead compound; for example. 
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cell proliferation can be assayed by measuring bromodeoxyuridine (BrdU) incorporation 
(see, e.g., Hoshino et al., 1986, Int. J. Cancer 38:369 and Campana et al., 1988, J. Immunol. 
Meth. 107:79) or (3H)-thymidme incorporation (see, e,g., den, 1996, Oncogene 13:1395- 
403 and Jeoung, 1995, J. BioL Chem. 270:18367-73), by direct cell count, by detecting 
5 changes in transcription, translation or activity of knoivn genes such as proto-oncogenes 
(e.g., fos, myc) or cell cycle markers (Rb, cdc2, cyclin A, Dl, D2, D3, E, etc.). The levels 
of such protein and mRNA and activity can be deteraained by any method well known in 
the art. For example, protein can be quantitated by known immunodiagnostic methods such 
as westem blotting or immunoprecipitation using conmiercially available antibodies. 

10 mRNA can be quantitated using methods that are well known and routine in the art, for 
example, using northern analysis, RNase protection, the polymerase chain reaction in 
connection with reverse transcription ("RT-PCR"). CeU viabihty can be assessed by using 
trypan-blue staining or other cell death or viability markers known in the art. In a specific 
embodiment, the level of cellular ATP is measured to determined cell viabihty. 

15 Differentiation can be assessed, for example, visually based on changes in morphology. 



transformation (or progression to malignant phenotype) in vitro. In this embodiment, cells 
with a transformed cell phenotype are contacted with a lead confound, and examined for 
change in characteristics associated with a transformed phenotype (a set of in vitro 

20 characteristics associated with a tumorigenic ability ui vivo), for example, but not limited 
to, colony formation in soft agar, a more rounded cell morphology, looser substratum 
attachment, loss of contact inhibition, loss of anchorage dependence, release of proteases 
such as plasminogen activator, increased sugar transport, decreased serum requirement, or 
expression of fetal antigens, etc. (see, e.g,, Luria et al., 1978, General Virology, 3d Ed., 

25 John Wiley & Sons, New York, pp. 436-446). 



demonstrate the anti-cancer effects of a lead compound- For example, an aspect of the 
formation of a metastatic cancer is the abiUty of a precancerous or cancerous cell to detach 
from primary site of disease and establish a novel colony of growth at a secondary site. The 
30 ability of a cell to invade peripheral sites reflects its potential for a cancerous state. Loss of 
invasiveness can be measured by a variety of techniques known in the art including, for 
example, induction of E-cadherin-mediated cell-cell adhesion. Such E-cadherin-mediated 
adhesion can result in phenotypic reversion and loss of invasiveness (see, e.g., Hordijk et 
al., 1997, Science 278:1464-66). 



The lead compound can also be assessed for its ability to inhibit cell 



Loss of invasiveness or decreased adhesion can also be assessed to 
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Loss of invasiveness can further be examined by inhibition of cell migration. 
A variety of 2"-dimensional and S-dimensional cellidar matrices are commercially available 
(Calbiochem-Novabiochem Corp. San Diego, CA). Cell migration across or into a matrix 
can be examined using microscopy, time-l^sed photography or videography, or by any 
5 method in liie art allowing measurement of cellular migration. In a related embodiment, 
loss of invasiveness is examined by response to hepatocyte growth factor ('HGF"). HGF- 
induced cell scattering is correlated with invasiveness of cells such as Madin-Daifoy canine 
kidney ("MDCK") cells. This assay identifies a cell population that has lost cell scattering 
activity in response to HGF (see, e.g., Hordijk et al., 1997, Science 278:1464-66). 

10 Alternatively, loss of invasiveness can be measured by cell migration 

through a chemotaxis chamber (Neuroprobe/ Precision Biochemicals Inc. Vancouver, BQ. 
In such assay, a chemo-attractant agent is incubated on one side of the chamber (e.g., the 
bottom chamber) and cells are plated on a filter separating the opposite side (e.g., the top 
chamber). In order for cells to pass from the top chamber to the bottom chamber, the cells 

15 must actively migrate through small pores in the filter. Checkerboard analysis of the 
number of cells that have migrated can then be correlated with invasiveness (see e.g., 
Ohnishi, 1993, Biochem. Biophys. Res. Commun.l93:518-25). 

A lead compound can also be assessed for its ability to alter the expression 
of a secondary protein (as determined, e.g. by western blot analysis) or RNA, whose 

20 expression and/or activation is regulated directiy or indirectly by the gene product of a gene 
of interest containing a premature stop codon or a nonsense mutation (as determined, e.g., 
by RT-PCR or northern blot analysis) in cultured cells in vitro using methods which are 
well known in the art. Further, chemical footprinting analysis can be conducted and is well- 
known in the art. 

25 In another embodiment of the invention, the lead compound can be tested in 

a host cell. In such an embodiment, the host cell can enode a nucleic acid with a premature 
translation termination sequence or stop codon. Such nucleic acids can be encoded by a 
number of vehicles, including, but not limited to, recombinant or chimeric vectors, viruses 
or the genome of the host cell. In another embodiment of the invention, the presence of the 

30 gene, containing a premature stop codon or translation termination sequence, causes a 
detectable phenotype in the host cell. Moreover, the effect of lead conipounds on the 
phenotype of the cell can be examined in order to determine suitable candidates that 
modulate premature translation termination fix>m a pool of compoimds. In one 
embodiment, a host cell containmg a gene encoding a premature translation termination 
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sequence or stop codon, exhibits an abnormal phenotype relative to the wild type cell that 
does not encode a gene with a premature stop codon. hi such an embodiment, the effect of 
a compound on the host cell phenotype can be examined in order to determine the effect of 
a lead compoimd on premature translation termination or nonsense mediate mRNA decay. 
5 By way of example and not meant to limit the possible models, host cells, expressing 
mutations in a gene that controls cell cycle or proliferation, e.g., p53, can be exposed to 
various lead compounds in order to determine their effect on cell proliferation. Any lead 
compound that affects the proliferative activity of the host cell is identified as a compound 
that modulates premature translation termination or nonsense mediated mRNA decay. 

10 5.9.2 Animal Models 

Animal model systems can be used to demonstrate the safety and efficacy of 
the lead compounds identified in the nonsense suppression assays described above. The 
lead compounds identified in the nonsense suppression assay can then be tested for 
biological activity using animal models for a disease, condition, or syndrome of interest. 

IS These include animals engineered to contain the target RNA element coupled to a 
functional readout system, such as a transgenic mouse. 

There are a number of methods that can be used to conduct animal model 
studies. Briefly, a compoxmd identified in accordance with the methods of the invention is 
introduced into an animal model so that the effect of the compound on the manifestation of 

20 disease can be determined. The prevention or reduction in the severity, duration or onset of 
a symptom associated with the disease or disorder of the animal model that is associated 
with, characterized by or caused by premature translation termination and/or nonsense 
mediated mRNA decay would indicate that the compound adminstered to the animal model 
had a prophylactic or therapeutic effect. Any method can be used to introduce the 

25 compound into the animal model, including, but not limited to, injection, intravenous 
mfusion, oral ingestion, or inhalation. In a preferred embodiment, transgenic hosts are 
constructed so that the animal genome encodes a gene of interest with a premature 
translation termination sequence or stop codon. In such an embodiment, the gene, containg 
a premature translation termination sequence or stop codon, would not encode a full length 

30 peptide firom a transcribed mRNA The adminsitration of a compound to the animal model, 
and the expression of a full length protein, polypeptide or peptide, for example, 
corresponding to the gene contaiiung a premature stop codon would indicate that the 
compound modulates premature translation termination. Any method known in the art, or 
described herein, can be used to determine if the stop codon was modulated by the 

111 




wo 2004/010106 PCT/US2003/023075 
compound. In another embodiment, the animal host genome encodes a native gene 
containing a premature stop codon. In another embodiment of the invention, the animal 
host is a natural mutant^ te.^ natively encoding a gene with a premature stop codon. For 
example, the animal can be a model for cystic fibrosis wherein the animal genome contains 
5 a natural mutation that incotporates a premature stop codon or translation termination 
sequence. 

Examples of animal models for cystic fibrosis include, but are not limited to, 
cftr(-/-) mice (see, e.g., Freedman et aL, 2001, Gastroenterology 121(4):950"7), 
cftr(tmlHGU/tmlHGU) mice (see, e.g., Bemhard et al, 2001, Exp Lung Res 27(4):349- 

10 66), CFTR-deficient mice with defective cAMP-mediated Cl(-) conductance (see, e.g., 
Stotland et al, 2000, Pediatr Puhnonol 30(5):413-24), and C57BL/6- 
Cftr(mlUNC)/Cftr(mlUNC) knockout mice (see, e.g., Stotland et al, 2000, Pediatr 
Puhnonol 30(5):413-24). 

Examples of animal models for muscular dystrophy include, but are not 

15 limited to, mouse, hamster, cat, dog, and C. elegans. Examples of mouse models for 

muscular dystrophy include, but are not limited to, the dy-/- mouse (see, e,g., Connolly et 
aL, 2002, J Neuroimmunol 127(l-2):80-7), a muscular dystrophy with myositis (mdm) 
mouse mutation (see, e.g,, Garvey et aL, 2002, Genomics 79(2):146-9), the mdx mouse 
(see, eg., Nakamura et aL, 2001, Neuromuscul Disord 1 l(3):2Sl-9), the utrophin- 

20 dystrophin knockout (dko) mouse (see, e.g., Nakamura et aL, 2001, Neuromuscul Disord 
1 1(3):251-9), the dy/dy mouse (see, e.g., Dubowitz et al, 2000, Neuromuscul Disord 10(4- 
5):292-8), the mdx(Cv3) mouse model (see, eg., Fillers et al, 1999, Laryngoscope 
109(8):1310-2), and tiie myotonic ADR-MDX mutant mice (see, e.g., Kramer et al, 1998, 
Neuromuscul Disord 8(8):542-50). Examples of hamster models for muscular dystrophy 

25 include, but are not Umited to, sarcoglycan-deficient hamsters (see, e.g., Nakamura et al., 
2001, Am J Physiol Cell Physiol 281(2):C690-9) and the BIO 14.6 dystrophic hamster (see, 
eg., Schlenker & Burbach, 1991, J Appl Physiol 71(5): 1655-62). An example of a feline 
model for muscular dystrophy includes, but is not limited to, the hypertrophic feline 
muscular dystrophy model (see, eg., Gaschen & Burgunder, 2001, ActaNeuropathol (Berl) 

30 101(6):591-600). Canine models for muscular dystrophy include, but are not Umited to, 
golden retriever muscular dystrophy (see, eg., Fletcher et aL, 2001, Neuromuscul Disord 
1 1(3):239-43) and canine X-linked muscular dystrophy (see, eg., Valentine et al, 1992, 
Am J Med Genet 42(3):352-6). Examples of C. elegans models for muscular dystrophy are 
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described in Chamberiain & Benian, 2000, Curr Biol 10(21):R795-7 and Culette & Sattelle, 

2000, Hum Mol Genet 9(6):869-77. 

Examples of animal models for familial hypercholesterolemia include, but 
are not limited to, mice lacking functional LDL receptor genes (see, e.g., Aji et aL, 1997, 
5 Circulation 95(2):430-7), Yoshida rats (see, eg., Fantappie et aL, 1992, Life Sci 

50(24):1913-24), the JC3l:LA-cp rat (see, Richardson et aL, 1998, Atherosclerosis 
138(l):135-46), swine (see, eg., Hasler-Rapacz et aL, 1998, Am J Med Genet 76(5):379- 
86), and the Watanabe heritable hyperlipidaemic rabbit (see, eg:, Tsutsumi et aL, 2000, 
Arzneimittelforschung 50(2):118-21; Harsch et aL, 1998, Br JPhaimacol 124(2):227-82; 

10 and Tanaka et aL, 1995, Atherosclerosis 1 14(l):73-82). 

An example of an animal model for himian cancer in general includes, but is 
not limited to, spontaneously occurring tumors of companion animals (see, eg., Vail & 
MacEwen, 2000, Cancer Invest 18(8):781-92). Examples of animal models for lung cancer 
include, but are not limited to, lung cancer animal models described by Zhang & Roth 

15 (1994, In Vivo 8(5):755-69) and a transgenic mouse model with disnq>ted p53 function 
(see, eg., Morris et aL, 1998, J La State Med Soc 150(4): 179-85). An example of an 
animal model for breast cancer includes, but is not limited to, a transgenic mouse that 
overexpresses cychn Dl (see, eg., Hosokawa et aL, 2001, Transgenic Res 10(5):471-8). 
An example of an animal model for colon cancer includes, but is not limited to, a TCRbeta 

20 and p53 double knockout mouse (see, eg., Kado et aL, 2001, Cancer Res 61(6):2395-8). 
Examples of animal models for pancreatic cancer include, but are not limited to, a 
metastatic model of Panc02 murine pancreatic adenocarcinoma (see, eg, Wang et aL, 

2001, Int J Pancreatol 29(l):37-46) and nu-nu mice generated in subcutaneous pancreatic 
tumours (see, eg, Ghaneh et aL, 2001, Gene Ther 8(3): 199-208). Examples of animal 

25 models for non-Hodgkin's lymphoma include, but are not limited to, a severe combined 

immunodeficiency ("SCED") mouse (see, eg, Bryant et aL, 2000, Lab Invest 80(4):553-73) 
and an IgHmu-HOXll transgenic mouse (see, eg. Hough et aL, 1998, Proc Natl Acad Sci 
USA 95(23):13853-8). An example of an animal model for esophageal cancer includes, but 
is not limited to, a mouse transgenic for the himian papillomavirus type 16 E7 oncogene 

30 (see, eg, Berber et aL, 1996, J Virol 70(3): 1873-81). Examples of animal models for 

colorectal carcinomas include, but are not limited to. Ape mouse models (see, eg, Fodde & 
Smits, 2001, Trends Mol Med 7(8):369-73 and Kuraguchi et aL, 2000, Oncogene 
19(50):5755-63). An example of an animal model for neurofibromatosis includes, but is 
not limited to, mutant NFl mice (see, eg., Cichowski et aL, 1996, Semin Cancer Biol 
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7(5):291 -8). Examples of animal models for retinoblastoma include, but are not limited to, 
transgenic mice that expression the simian virus 40 T antigen in the retina (see, e.g., Howes 
et al, 1994, Invest Ophthahnol Vis Sci 35(2):342-51 and Windle et ah 1990, Nature 
343(6259):665-9) and inbred rats (see, e.g., Nishida et a/., 1981, Curr Eye Res l(l):53-5 
5 and Kobayashi et al, 1982, Acta Neuropathol (Berl) 57(2-3):203-8). Examples of animal 
models for Wilm's tumor include, but are not limited to, a WTl knockout mice (see, e.g.y 
Schamhorst et al, 1997, Cell Growth Differ 8(2): 133-43), a rat subline with a high 
iucidence of neuphroblastoma (see, e.g., Mesfin & Breech, 1996, Lab Aoim Sci 46(3):321- 
6), and a Wistar/Furth rat with Wilms* tumor (see, e,g.. Murphy et aL, 1987, Anticancer 
10 Res7(4B):717-9). 

Examples of animal models for retinitis pigmentosa include, but are not 
limited to, the Royal College of Surgeons C*RCS") rat (see, e.g., Vollrath et al, 2001, Proc 
Natl Acad Sci USA 98(22); 12584-9 and Hanitzsch et al, 1998, Acta Anat (Basel) 162(2- 

3) :119-26), a rhodopsin knockout mouse (see, eg, Jaissle et aL, 2001, Invest Ophthalmol 
15 Vis Sci 42(2):506-13), and Wag/Rij rats (see, eg, Lai et al, 1980, Am J Pathol 98(1):281- 

4) . 

Examples of animal models for cirrhosis include, but are not limited to, 
CCU-exposed rats (see, e.g., Kloehn et a/., 2001, Horm Metab Res 33(7):394-401) and 
rodent models instigated by bacterial cell components or colitis (see, e,g,, Vierling, 2001, 

20 Best Pract Res Clin Gastroenterol 15(4):591-610). 

Examples of animal models for hemophilia include, but are not limited to, 
rodent models for hemc^hilia A (see, e.g,, Reipert et aL, 2000, Thromb Haemost 84(5):826- 
32; Jarvis et aL,. 1996, Thromb Haemost 75(2):318-25; and Bi et al, 1995, Nat Genet 
10(1):1 19-21), canine models for hemophilia A (see, e.g., Gallo-Penn et al, 1999, Hum 

25 Gene Ther 10(ll):1791-802 and Connelly et al, 1998, Blood 91(9);3273-81), murine 

models for hemophilia B (see, eg, Snyder et al, 1999, Nat Med 5(l):64-70; Wang et al, 
1997, Proc Natl Acad Sci USA 94(21):1 1563-6; and Fang et al, 1996, Gene Ther 3(3):217- 
22), canine models for hemophiha B (see, eg. Mount et al, 2002, Blood 99(8):2670-6; 
Snyder et al, 1999, Nat Med 5(l):64-70; Fang et al, 1996, Gene Ther 3(3):217-22); and 

30 Kay et al, 1994, Proc Natl Acad Sci USA 91(6):2353-7), and a rhesus macaque model for 
hemophilia B (see, eg, Lozier et al, 1999, Blood 93(6):1875-81). 

Examples of animal models for von Willebrand disease include, but are not 
limited to, an inbred mouse strain RUIS/J (see, e,g, Nichols et al, 1994, 83(1 1):3225-31 
and Sweeney et al, 1990, 76(ll):2258-65), rats injected with botrocetin (see, eg, Sanders 
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et al, 1988, Lab Invest 59(4):443-52), and porcine models for von Willebrand disease (see, 
Nichols et al, 1995, Proc Natl Acad Sci USA 92(7):2455-9; Johnson & Bowie, 1992, J 
Lab Clin Med 120(4):553-8); and Brinkhous et aL, 1991, Mayo Clin Proc 66(7):733-42). 

Examples of animal models for p-thalassemia include, but are not limited to, 
5 murine models with mutations in globin genes (see, e.g., Lewis et aL, 1998, Blood 

91(6):2152-6; Raja et aL, 1994, Br J HaCTiatol 86(l):156-62; Popp et al, 1985, 445:432-44; 
and Skow et al, 1983, Cell 34(3): 1043-52). 

Examples of animal models for kidney stones include, but are not limited to, 
genetic hypercalciuric rats (see, e,g,^ Bushinsky et a/., 1999, Kidney Iht 55(l):234-43 and 
10 Bushinsky et ah, 1995, Kidney Iht 48(6):1705-13), chemically treated rats (see, e.g., Grases 
et a/., 1998, Scand J Urol Nephrol 32(4):261-5; Burgess et al, 1995, Urol Res 23(4):239- 
42; Kumar et al, 1991, J Urol 146(5):1384-9; Okada et al, 1985, Hinyokika Kiyo 
31(4):565-77; and Bluestone et al, 1975, Lab Invest 33(3):273-9), hyperoxaluric rats (see, 
e.g,, Jones et al., 1991, J Urol 145(4):868-74), pigs with unilateral retrograde flexible 
15 nephroscopy (see, e.g., Seifinah et al, 2001, 57(4):832-6), and rabbits with an obstructed 
upper urinary tract (see, e.g., Itatani et al, 1979, Invest Urol 17(3):234-40), 

Examples of animal models for ataxia-telangiectasia include, but are not 
limited to, murine models of ataxia-telangiectasia (see, Barlow et aL^ 1999, Proc Natl 
Acad Sci USA 96(17):9915-9 and Inoue et al, 1986, Cancer Res 46(8):3979-82). A mouse 
20 model was generated for ataxia-telangiectasia using gene targeting to generate mice fbat did 
not express the Atm protein (see, e,g,, Elson et aL, 1996, Proc. Nat. Acad. Sci. 93: 13084- 
13089). 

Examples of animal models for lysosomal storage diseases include, but are 
not limited to, mouse models for mucopolysaccharidosis type VII (see, e,g. Brooks et al, 

25 2002, Proc Natl Acad Sci USA. 99(9):6216-21; Monroy et al, 2002, Bone 30(2):352-9; 
Vogler et al, 2001, Pediatr Dev Pathol. 4(5):421-33; Vogler et al, 2001, Pediatr Res. 
49(3):342-8; and Wolfe et al, 2000, Mol Ther. 2(6):552-6), a mouse model for 
metachromatic leukodystrophy (see, e.g., Matzner et al, 2002, Gene Ther. 9(l):53-63), a 
mouse model of Sandhoff disease (see, e.g., Sango et al, 2002, Neuropathol Appl 

30 NeurobioL 28(l):23-34), mouse models for mucopolysaccharidosis type HI A (see, e.g., 
Bhattacharyya et al, 2001, Glycobiology 1 1(1):99-10 and Bhaumik et al, 1999, 
Glycobiology 9(12):1389-960, arylsulfatase A (ASA)-deficient mice (see, e.g., D*Hooge et 
al, 1999, Brain Res. 847(2):352-6 and D'Hooge et al, 1999, Neurosci Lett. 273(2):93-6); 
mice with an aspartylglucosaminuria mutation (see, e.g., Jalanko et al., 1998, Hum Mol 
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Genet. 7(2):265-72); feline models of mucopolysaccharidosis type VI (see, e.g., Crawley et 
al, 1998, J Clin Invest. 101(1):109-19 and Norrdin et al., 1995, Bone 17(5):485-9); a feline 
model of Niemann-Pick disease type C (see, March et aL, 1997, Acta Neuropathol 
^erl). 94(2): 164-72); acid sphingomyelinase-deficient mice (see, e,g., Otterbach & Stoffel, 
5 1995, Cell 8 1(7): 1053-6), and bovine mannosidosis (see, eg.. Jolly et al., 1975, Birth 
Defects Orig Artie Ser. 11(6):273.8). 

Examples of animal models for tuberous sclerosis (*TSC") include, but are 
not limited to, a mouse model of TSCl (see, e.g., Kwiatkowski et al, 2002, Hum Mol 
Genet. ll(5):525-34), a Tscl (TSCl homologue) knockout moxise (see, e.g., Kobayashi et 
10 al, 2001, Proc Natl Acad Sci USA. 2001 Jul 17;98(15):8762-7), a TSC2 gene 

mutant(Eker) rat model (see, e.g., Hino 2000, Nippon Rinsho 58(6): 1255-61; Mizuguchi et 
al, 2000, J Neuropathol Exp Neurol. 59(3):188-9; and Hino et al, 1999, Prog Exp Tumor 
Res. 35:95-108); and Tsc2(+/-) mice (see, e.g., Qnda et al., 1999, J Clin Invest. 
104(6):687-95). 

15 5.9.3 Toxicitv 

The toxicity and/or efficacy of a compound identified in accordance with the 
invention can be detennined by standard pharmaceutical procedures in cell cultures or 
experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population). 

20 Cells and cell lines that can be used to assess the cytotoxicity of a compound identified in 
accordance with the invention include, but are not limited to, peripheral blood mononuclear 
cells (PBMCs), Caco-2 cells, and Huh7 cells. The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as the ratio LD50/BD50. A compound 
identified in accordance with the invention that exhibits large therapeutic indices is 

25 preferred. While a compound identified in accordance with the invention that exhibits toxic 
side effects may be used, care should be taken to design a delivery system that targets such 
agents to the site of affected tissue in order to minimize potential damage to uninfected cells 
and, thereby, reduce side efifects. 

The data obtained fi-om the cell culture assays and animal studies can be used 

30 in formulating a range of dosage of a compound identified in accordance with the invention 
for use in humans. The dosage of such agents Ues preferably within a range of circulating 
concentrations that include the ED50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration 
utilized. For any agent used in the method of the invention, the therapeutically effective 
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dose can be estimated initially from cell culture assays. A dose may be formulated in 
animal models to achieve a circulating plasma concentration range that includes the IC50 
{Le.^ the concentration of the compound that achieves a half-maximal inhibition of 
symptoms) as determined in cell culture. Such information can be used to more accurately 
5 determine useful doses in humans. Levels in plasma may be measured, for example, by 
hig^ performance liquid chromatography. 

5.10 Design of Coneeners or Analogs 

The compounds which display the desired biological activity can be used as 
lead compounds for the development or design of congeners or analogs having useful 

10 pharmacological activity. For example, once a lead compound is identified, molecular 
modeling techniques can be used to design variants of the compound that can be more 
effective. Examples of molecular modeling systems are the CHARM and QUANTA 
programs (Polygen Corporation, Waltham, MA). CHARM performs the energy 
minimization and molecular dynamics fimctions. QUANTA perfonns the constraction, 

IS graphic modelling and analysis of molecular structure. QUANTA allows interactive 
construction, modification, visualization, and analysis of the behayior of molecules with 
each other. 

A number of articles review computer modeling of drugs interactive with 
specific proteins, such as Rotivinen et al., 1988, Acta Pharmaceutical Fennica 97:159-166; 

20 Ripka, 1998, New Scientist 54-57; McKinaly & Rossmann, 1989, Aimu. Rev. Pharmacol. 
Toxiciol. 29:1 1 1-122; Perry & Davies, OSAR: Quantitative Structure-Activity 
Relationships in Drug Design pp. 189-193 (Alan R. liss. Inc. 1989); Lewis & Dean, 1989, 
Proc. R. Soc. Lond, 236:125-140 and 141-162; Askew et al., 1989, J. Am. Chem. Soc. 
Ill :1082-1090. Other computer programs that screen and graphically depict chemicals are 

25 available from companies such as BioDesign, Inc. (Pasadena, California), Allelix, Inc. 

(Mississauga, Ontario, Canada), and Hypercube, Inc. (Cambridge, Ontario). Although these 
are primarily designed for application to drugs specific to particular proteins, they can be 
adapted to design of drugs specific to any identified region. The analogs and congeners can 
be tested for binding to translational machinery using assays well-known in tiie art or 

30 described herein for biologic activity. Alternatively, lead compounds with litfle or no 
biologic activity, as ascertained in the screen, can also be used to design analogs and 
congeners of the compoimd that have biologic activity. 

5,11 Uses of Compounds to Prevent/Treat a Disorder 
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The present invention provides methods of preventing, treating, managing or 
ameliorating a disorder or one or more symptoms thereof, said methods comprising 
administering to a subject in need thereof one or more compounds identified in accordance 
with the methods of the invention or a pharmaceutically acceptable salt thereof. In 
5 particular, the present invention provides methods of preventing, treating, managing or 
ameliorating a disorder associated with premature translation termination and/or nonsense- 
mediated mRNA decay, or one or more symptoms thereof, said methods comprising 
administering to a subject in need thereof one or more compounds identified in accordance 
with the mefliods of the invention or a pharmaceutically acceptable salt tiiereof. Examples 

10 of diseases associated wilh, characterized by or caused by associated with premature 

translation termination and/or nonsense-mediated mRNA decay include, but are not limited 
to, cystic fibrosis, muscular dystrophy, heart disease, lung cancer, breast cancer, colon 
cancer, pancreatic cancer, non-Hodgldn's lymphoma, ovarian cancer, esophageal cancer, 
colorectal carcinomas, neurofibromatosis, retinoblastoma, Wilm's tumor, retinitis 

15 pigmentosa, coUagen disorders, cirrhosis, Tay-Sachs disease, blood disorders, kidney 
stones, ataxia-telangiectasia, lysosomal storage diseases, and tuberous sclerosis. See 
Sections 5.8 and 6.5 for additional non-limiting examples of diseases and genetic disorders 
which can be prevented, treated, managed or ameliorated by administering one or more of 
the compounds identified in accordance with the methods of the invention or a 

20 pharmaceutically acceptable salt thereof. Genes that contain one or more nonsense 
mutations that are potentially involved in causing disease are presented in table form 
according to chromosome location in Example 6.5 infra. 

In a preferred embodiment, it is first determined that the patient is suffering 
firom a disease associated with premature translation termination and/or nonsense-mediated 

25 mRNA decay before administering a compound identified in accordance with the invention 
or a combination therapy described herein. In a preferred embodiment, the DNA of the 
patient can be sequenced or subject to Southern Blot, polymerase chain reaction (PGR), use 
of the Short Tandem Repeat (STR), or polymorphic length restriction firagments (RFLP) 
analysis to determine if a nonsense mutation is present in the DNA of the patient 

30 Altematively, it can be determined if altered levels of the protein with the nonsense 
mutation are expressed in the patient by westem blot or other immunoassays. Such 
mefliods are well known to one of skill in the art. 

In one embodiment, flie invention provides a method of preventing, treating, 
managing or ameliorating a disorder or one or more symptoms thereof said method 
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comprising adnumstering to a subject in need thereof a dose of a prophylactically or 
therapeutically effective amount of one or more compounds IdentiiBed in accordance with 
the methods of the invention. In another embodiment, a compound identified in accordance 
with the methods of the invention is not administered to prevent, treat, or ameliorate a 
5 disorder or one or more symptoms thereof, if such compound has been used previously to 
prevent, treat, manage or ameliorate said disorder. In a more specific embodiment of the 
invention, disorders that can be prevented, managed and/or treated with the compounds of 
the invention, include, but are not limited to, disorders that are associated with, 
characterized by or caused by premature translation termination and/or nonsense mediated 
10 mRNA decay. 

The invention provides methods of preventing, treating, managing or 
ameliorating a disorder or one or more symptoms thereof, said methods comprising 
administering to a subject in need thereof one or more of the compoimds identified utilizing 
the screening methods described herein or a phaxmaceutically acceptable salt thereof, and 

15 one or more other therapies (e,g,, prophylactic or therapeutic agents). In particular, the 
invention provides methods of preventing, treating, managing or ameliorating a disorder 
associated with, characterized by or caused by premature translation termination and/or 
nonsense mediate mRNA decay, or one or more symptoms thereof, said methods 
comprising administering to a subject in need thereof one or more of the compounds 

20 identified utilizing the screening methods described herein or a pharmaceutically acceptable 
salt thereof, and one or more other therapies (e.g., prophylactic or therapeutic agents). 
Preferably, the other ther^ies are currently being used, have been used or are known to be 
usefiil in the prevention, treatment, management or amelioration of said disorder or a 
symptom thereof. Non-Umiting examples of such therapies are in Section 5.11.1 infra. 

25 The therapies (eg., prophylactic or therapeutic agents) or the combination 

therapies of the invention can be administered sequentially or concurrently. In a specific 
embodiment, the combination therapies of the invention comprise a compound identified in 
accordance with the invention and at least one other therapy that has the same mechanism 
of action as said compound. In another specific embodiment, the combination therapies of 

30 the invention comprise a compound identified in accordance with the methods of the 

invention and at least one other therapy prophylactic or therapeutic agent) which has a 
different mechanism of action than said compound. The combination therapies of tiie 
present invention improve the prophylactic or therapeutic effect of a compound of the 
invention by functioning together with the compound to have an additive or synergistic 
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effect. The combination therapies of the present invention reduce the side eJBfects associated 
with the therapies (e.g., prophylactic or therapeutic agents). 

The prophylactic or therapeutic agents of the combination therapies can be 
administered to a subject in the same pharmaceutical composition. Alternatively, the 
S prophylactic or ther^eutic agents of the combination therapies can be administered 
concurrently to a subject in separate pharmaceutical compositions. The prophylactic or 
thorapeutic agents may be administered to a subject by the same or different routes of 
administration. 

In a specific embodiment, a pharmaceutical composition comprising one or 

10 more compounds identified in a screening assay described herein is administered to a 
subject, preferably a human, to prevent, treat, manage or ameliorate a disorder (in 
particular, a disorder associated with, characterized by or caused by premature translation 
termination and/or nonsense mediated mRNA decay) or one or more symptoms thereof. In 
accordance with the invention, the pharmaceutical composition may also comprise one or 

15 more other prophylactic or therapeutic agents. Preferably, such prophylactic or theapeutic 
agents are currently being used, have been used or are known to be useM in the prevention, 
treatment, management or amelioration of a disorder (in particular, a disorder associated 
with, characterized by, or caused by premature translation termination or nonsense- 
mediated mRNA decay) or one or more symptoms thereof. 

20 A compound identified in accordance with the methods of the inveation may 

be used as a first, second, third, fourth or fifth line of therapy for a disorder (in particular, a 
disorder associated with, characterized by or caused by premature translation termination 
and/or nonsense-mediated mRNA decay). The invention provides methods for treating, 
managing or ameliorating a disorder (in particular, a disorder associated with, characterized 

25 by or caused by premature translation termination and/or nonsense-mediated mRNA decay) 
or one or more symptoms thereof in a subject refractory to conventional therapies for such 
disorder, said methods comprisiug administering to said subject a dose of a prophylactically 
or therapeutically effective amount of a compound identified in accordance with the 
methods of the invention. In particular, a disorder may be determined to be refractory to a 

30 therapy when at least some significant portion of the disorder is not resolved in response to 
the therapy. Such a determination can be made either in vivo or in vitro by any method 
known in the art for assaying the effectiveness of a therapy on a subject, using the art- 
accepted meanings of "refractory" in such a context In a specific embodiment, a disorder 
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is refractory where the number of symptoms of the disorder has not been significantly 
reduced, or has increased. 

The invention provides methods for treating, managing or ameliorating one 
or more symptoms of a disorder (in particular, a disorder associated with, characterized by 
5 or caused by premature translation temiination and/or nonsense-mediated mRNA decay) in 
a subject refractory to existing single agent ther{5)ies for such disorder, said methods 
comprising administering to said subject a dose of a prophylactically or th^peutically 
effective amount of a compound identified in accordance with the methods of the invention 
and a dose of a prophylactically or therapeutically effective amount of one or more other 

10 therapies (e.g. , prophylactic or therapeutic agents). The invention also provides methods 
for treating or managing a disorder (in particular, a disorder associated with, characterized 
by or caused by premature translation termination and/or nonsense-mediated mRNA decay) 
by administering a compound identified in accordance with the methods of the invention in 
combination with any other therapy (e.^., radiation therapy, chemotherapy or surgery) to 

15 patients who have proven refractory to other therapies but are no longer on these therapies. 
The invention also provides methods for the treatment or management of a patient having 
disorder (in particular, a disorder associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay) and said patient is 
immunosuppressed or immunocompromised by reason of having previously xmdergone 

20 other therapies. Further, tiie invention provides methods for preventing the recurrence of a 
disorder (in particular, a disorder associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay) such as, e.g., cancer in 
patients that have been undergone therapy and have no disease activity by administering a 
compound identified in accordance with the methods of the invention. 

25 In addition to the use of the compounds identified in accordance with the 

invention for the prevention, treatment, management or ameUoration of a disorder or a 
symptom thereof, the compounds may be used in vitro to modulate the expression of 
particular genes of interest, for example, the compounds may be used to increase or 
decrease the expression of a particular gene of interest when conducting in vitro studies. 

30 5-11.1 Other Therapies 

The present invention provides methods of preventing, treating, managing or 
ameliorating a disordo* (in particular, a disorder associated with, characterized by or caused 
by premature translation termination and/or nonsense-mediated mRNA decay), or one or 
more symptoms thereof, said methods comprising administering to a subject in need thereof 
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one or more compounds identified in accordance with the methods of the invention or a 
pharmaceutically acceptable salt thereof^ and one or more other therapies (e.g., prophylactic 
or therapeutic agents). Any therapy (e.g., chemotherapies, radiation therapies, hormonal 
therapies, and/or biological therapies/immunotherapies) which, is known to be useful, or 
5 which has been used or is currently being used for the prevention, treatment, management 
or amelioration of a disorder (in particular, a disorder associated with, characterized by or 
caused by premature translation termination and/or nonsense-mediated mRNA decay) or 
one or more symptoms thereof can be used in combmation with a compound identified in 
accordance with the methods of the invention. Examples of therapeutic or prophylactic 

10 agents which can be used in combination with a compound identified in accordance with 
the invention include, but are not limited to, peptides, polypeptides, fiision proteins, nucleic 
acid molecules, small molecules, mimetic agents, synthetic drugs, inorganic molecules, and 
organic molecules. 

Proliferative disorders associated with, characterized by or caused by 

1 5 premature translation termination and/or nonsense-mediated mRNA decay can be 

prevented, treated, managed or ameliorated by administering to a subject in need thereof 
one or more of the compounds identified in accordance with the methods of the invention, 
and one or more other therapies for prevention, treatment, management or amelioration of 
said disorders or a symptom thereof. Examples of such therapies include, but are not 

20 limited to, angiogenesis inhibitors, topoisomerase inhibitors, immunomodulatory agents 

(such as chemotherapeutic agents) and radiation ther^y. Angiogenesis inhibitors (i.e., anti- 
angiogenic agents) include, but are not limited to, angiostatin (plaaninogen fragment); 
antiangiogenic antithrombin HI; angiozyme; ABT-627; Bay 12-9566; Benefin; 
Bevacizumab; BMS-275291; cartilage-derived inhibitor (GDI); CAI; CD59 complement 

25 fragment; CEP-7055; Col 3; combretastatin A-4; endostatin (collagen XVm fragment); 
fibronectin fragment; Gro-beta; Halofiiginone; Hepaiinases; Heparin hexasaccharide 
fragment; HMV833; hxmian chorionic gonadotropin (hCG); IM-862; Interferon 
alpha^eta/gamma; Interferon inducible protein (IP- 10); Interleukui-12; Kringle 5 
(plasminogen fragment); Marimastat; Metalloproteinase inhibitors (TIMPs); 2- 

30 methoxyestradiol; MMI 270 (CGS 27023A); MoAb MC-lCl 1; Neovastat; NM-3; Panzem; 
PI-88; Placental ribonuclease inhibitor; plasminogen activator inhibitor; platelet factor-4 
0PF4); Prinomastat; Prolactin 16kD fragment; Proliferin-related protein (PRP); PTK 
787/ZK 222594; retinoids; solimastat; squalamine; SS 3304; SU 5416; SU6668; SU11248; 
tetrahydrocordsol-S; tetrathiomolybdate; thalidomide; thrombospondin-1 (TSP-1); TNP- 
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470; transforming growth factor-beta; vasculostatin; vasostatin (caJreticulin fragment); 

ZD6126; ZD 6474; famesyl transferase inhibitors (FTI); and bisphosphonates. In a specific 

embodiment, anti-angiogenic agents do not include antibodies or fragments thereof that 

immimospecifically bind to integrin ovft- 
5 Specific examples of propylactic or therapeutic agents which can be used in 

accordance with the methods of the invention to prevent, treat, manage or ameliorate a 

proliferative disorder associated with, characterized by or caused by premature translation 

termination and/or nonsense-mediated mRNA decay, or a symptom thereof include, but not 

limited to: acivicin; aclarubicin; acodazole hydrochloride; acronine; adozelesin; 
10 aldesleukin; altretamine; ambomyciti; ametantrone acetate; aminoglutethimide; amsacrine; 

anastrozole; anthramycin; asparaginase; asperlin; azacitidine; azetepa; azotomycin; 

batimastat; benzodepa; bicalutamide; bisantrene hydrochloride; bisnafide dimesylate; 

bizelesm; bleomycin sulfate; brequinar sodium; bropirimine; busulfan; cactinomycin; 

calusterone; caracemide; carbetimer; carboplatia; carmustiQe; carubicin hydrochloride; 
15 carzelesin; cedefingol; chlorambucil; cirolemycin; cisplatin; cladribine; crisnatol mesylate; 

cyclophosphamide; cytarabine; dacarbazine; dactinomycin; daunorubicin hydrochloride; 

decitabine; dexormaplatin; dezaguanine; dezaguanine mesylate; diaziquone; docetaxel; 

doxorubicin; doxorubicin hydrochloride; droloxifene; droloxifene citrate; dromostanolone 

propionate; duazomycin; edatrexate; eflomithine hydrochloride; elsamitmcin; enloplatin; 
20 enpromate; epipropidine; epimbicin hydrochloride; erbulozole; esombicin hydrochloride; 

estramustine; estramustine phosphate sodium; etanidazole; etoposide; etoposide phosphate; 

etoprine; fadrozole hydrochloride; fazarabine; fenretinide; floxuridine; fludarabine 

phosphate; fluorouracil; flurocitabine; fosquidone; fostriecin sodium; gemcitabine; 

gemcitabine hydrochloride; hydroxyurea; idarubicin hydrochloride; ifosfamide; ihnofosine; 
25 interleukin U (including recombinant interlexddn U, or rIL2), interferon alpha-2a; interferon 

alpha-2b; interferon alpha-nl ; interferon alpha-n3; interferon beta-X a; interferon gamma-I 

b; iproplatin; irinotecan hydrochloride; lanreotide acetate; letrozole; leuprolide acetate; 

liarozole hydrochloride; lometrexol sodixmi; lomustine; losoxantrone hydrochloride; 

masoprocol; maytansine; mechlorethamine hydrochloride; megestrol acetate; melengestrol 
30 acetate; melphalan; menogaril; merc^topurine; methotrexate; methotrexate sodium; 

metoprine; meturedepa; mitindomide; mitocarcin; mitocromin; mitogillin; mitomalcii^ 

mitomycin; mitosper, mitotane; mitoxantrone hydrochloride; mycophenolic acid; 

nocodazole; nogalamycin; ormaplatin; oxisuran; paclitaxel; pegaspargase; peliomycin; 

pentamustine; peplomycin sulfate; perfosfamide; pipobroman; piposulfan; pkoxantrone 
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hydrochloride; plicamycin; plomestane; porfimer sodium; porfiromycin; prednimxistine; 
procarbazdne hydrochloride; puromycin; puromycin hydrochloride; pyrazofurin; riboprme; 
rogletimide; sajSngol; safingol hydrochloride; semustine; simtrazene; sparfosate sodium; 
sparsomycin; spirogermamum hydrochloride; spiromustine; spiroplatin; streptonigrin; 
streptozocin; sulofenur; talisomycin; tecogalan sodium; tegafiir; teloxantrone hydrochloride; 
temoporfin; teniposide; teroxirone; testolactone; thiamiprine; thioguanine; thiotepa; 
tiazofurin; tirapazamine; toremifene citrate; trestolone acetate; triciribine phosphate; 
trimetrexate; trimetrexate glucuronate; triptorelin; tubulozole hydrochloride; uracil mustard; 
uredepa; vapreotide; verteporfin; vinblastine sulfate; vincristine sulfete; vindesine; 
vindesine sulfate; vinepidine sulfate; vinglycinate sulfate; vinleurosine sulfate; vinorelbine 
tartrate; vinrosidine sulfate; vinzolidine sulfate; vorozole; zeniplatin; zinostatin; zorubicin 
hydrochloride. Other anti-cancer drugs include, but are not limited to: 20-epi-l,25 
dihydroxyvitaminDS; 5-ethynyluracil; abiraterone; aclarubicin; acylfulvene; adecypenol; 
adozelesin; aldesleukin; ALL-TK antagonists; altretamine; ambamustine; amidox; 
amifostine; aminolevulinic acid; amrubicin; amsacrine; anagrelide; anastrozole; 
andrographolide; angiogenesis inhibitors; antagonist D; antagonist G; antarelix; 
anti-dorsalizing morphogenetic protein-1; antiandrogen, prostatic carcinoma; antiestrogen; 
antineoplaston; antisense oligonucleotides; q)hidicolin glycinate; apoptosis gene 
modulators; apoptosis regulators; apurinic acid; ara-CDP-DL-PTBA; arginine deaminase; 
asulacrine; atamestane; atrimustine; axinastatin 1; axinastatin 2; axinastatin 3; azasetron; 
azatoxin; azatyrosine; baccatin IE derivatives; balanol; batimastat; BCR/ABL antagonists; 
benzochlorins; benzoylstaurosporine; beta lactam derivatives; beta-alethine; betaclamycin 
B; betulinic acid; bFGF inhibitor; bicalutamide; bisantrene; bisaziridinylspermine; 
bisnafide; bistratene A; bizelesin; breflate; bropirimine; budotitane; buthionine sulfoximine; 
calcipotriol; calphostia C; camptothecin derivatives; canarypox IL-2; capecitabine; 
carboxamide-amino-triazole; carboxyamidotriazole; CaRestM3; CARN700; cartilage 
derived inhibitor; carzelesin; casein kinase inhibitors (ICOS); castanospermine; cecropin B; 
cetrorelix; chlorlns; chloroquinoxaline sulfonamide; cicaprost; cis-porphyrin; cladribine; 
clomifene analogues; clotrimazole; coUismycin A; colhsmycin B; combretastatin A4; 
combretastatin analogue; conagenin; crambescidin 816; crisnatol; cryptophycin 8; 
cryptophycin A derivatives; curacin A; cyclopentanthraquinones; cycloplatam; cypemycin; 
cj^tarabiae ocfosfate; cytolytic factor cytostatin; dacliximab; decitabine; dehydrodidemnin 
B; deslorelin; dexamethasone; dexifosfamide; dexrazoxane; dexverapamil; diaziquone; 
didemnin B; didox; diethylnorspermine; dihydro-5-azacytidine; dihydrotaxol, 9-; 
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dioxamycin; diphenyl spiromustine; docetaxel; docosanol; dolasetron; doxifluridine; 

droloxifene; dronabinol; duocarmycin SA; ebselen; ecomustine; edelfosine; edrecolomab; 

eflornithine; elemene; emitefiir; epirubicin; epristeride; estramustine analogue; estrogen 

agonists; estrogen antagonists; etanidazole; etoposide phosphate; exemestane; fadrozole; 
5 fazarabine; fenretinide; filgrastim; finasteride; flavopiridol; flezelastine; fluasterone; 

fludarabine; fluorodaunorunicin hydrochloride; forfenimex; foimestane; fostriecin; 

fotemustine; gadolinium texaphyiin; gallium nitrate; galocitabine; ganirelix; gelatinase 

inhibitors; gemcitabine; glutathione inhibitors; hepsulfam; heregulin; hexamethylene 

bisacetamide; hypericin; ibandronic acid; idarubicin; idoxifene; idramantone; ihnofosine; 
10 ilomastat; imidazoacridones; imiquimod; immunostimulant peptides; insulin-like growth 

factor- 1 receptor inhibitor; interferon agonists; interferons; interleukins; iobenguane; 

iododoxorubicin; ipomeanol, 4-; iroplact; irsogladine; isobengazole; isohomohalicondrin B; 

itasetron; jasplakinolide; kahalalide F; lamellarin-N triacetate; lanreotide; leinamycin; 

lenograstim; lentinan sulfate; leptolstatin; letrozole; leukemia inhibiting factor; leukocyte 
15 alpha interferon; leuproUde+estrogen4progesterone; leuprorelin; levamisole; harozole; 

linear polyamine analogue; lipophilic disaccharide peptide; lipophiUc platinum compounds; 

lissoclinamide 7; lobaplatin; lombricine; lometrexol; lonidamine; losoxantrone; lovastatin; 

loxoribine; lurtotecan; lutetium texaphyrin; lysofylline; lytic peptides; maitansine; 

mannostatin A; marimastat; masoprocol; maspin; matrilysin inhibitors; matrix 
20 metalloproteinase inhibitors; menogaril; m^barone; meterelin; methioninase; 

metoclopramide; MIF inhibitor; mifepristone; miltefosine; mirimostim; mismatched double 

stranded RNA; mitoguazone; mitolactol; mitomycin analogues; mitonafide; mitotoxin 

fibroblast growth factor-saporin; mitoxantrone; mofarotene; molgramostim; monoclonal 

antibody, human chorionic gonadotrophin; monophosphoryl lipid A+myobacterium cell 
25 wall sk; mopidamol; multiple drug resistance gene inhibitor; multiple tumor suppressor 

1-based therapy; mustard anticancer agent; mycaperoxide B; mycobacterial cell wall 

extract; myriaporone; N-acetyldinaline; N-substituted benzamides; nafarelin; nagrestip; 

naloxone+pentazocine; napavin; naphterpin; nartograstim; nedq)latin; nemorubicin; 

neridronic acid; neutral endopeptidase; nilutamide; nisamycin; nitric oxide modulators; 
30 nitroxide antioxidant; nitrullyn; 06-benzylguanine; octreotide; okicenone; oligonucleotides; 

onapristone; ondansetron; ondansetron; oracua; oral cytokine inducer; orm^latin; 

osaterone; oxaliplatin; oxaunomycin; paclitaxel; paclitaxel analogues; paclitaxel 

derivatives; palauamine; palmitoylrhizoxin; pamidronic acid; panaxytriol; panomifene; 

parabactin; pazelliptine; pegaspargase; peldesine; pentosan polysulfate sodium; pentostatin; 
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pentrozole; perflubron; perfosfamide; periUyl alcohol; phenazinomycin; phenylacetate; 
phosphatase inhibitors; picibanil; pilocarpine hydrochloride; pirarubicin; puitrexim; 
placetin A; placetin B; plasntiinogen activator inhibitor; platinum complex; platinum 
compounds; platinum-triamine complex; porjBmer sodium; poijSromycin; prednisone; 
5 propyl bis-acridone; prostaglandin J2; proteasome inhibitors; protein A-based immune 
modulator; protein kinase C inhibitor; protein kinase C inhibitors, microalgal; protein 
tyrosine phosphatase inhibitors; purine.nucleoside phosphorylase inhibitors; purpurins; 
pyrazoloacridine; pyridoxylated hemoglobin polyoxyethylene conjugate; raf antagonists; 
raltitrexed; ramosetron; ras famesyl protein transferase inhibitors; ras inhibitors; ras-GAP 

10 inhibitor; retelliptine demefhylated; rhenium Re 186 etidronate; rhizoxin; ribozymes; RE 
retinamide; rogletimide; rohitukine; romurtide; roquinimex; rubiginone Bl; mboxyl; 
safingol; saintopin; SarCNU; sarcophytol A; sargramostim; Sdi 1 mimetics; semustine; 
senescence derived inhibitor 1; sense oUgonucleotides; signal transduction inhibitors; signal 
transduction modulators; single chain antigen binding protein; sizofiran; sobuzoxane; 

15 sodium borocaptate; sodium phenylacetate; solverol; somatomedin binding protein; 
sonennin; sparfosic acid; spicamycinD; spiromustine; splenopentin; spongistatin 1; 
squalamine; stem cell inhibitor, stem-cell division inhibitors; stipiamide; stromelysin 
inhibitors; suljSnosine; superactive vasoactive mtestinal peptide antagonist; suradista; 
suramin; swainsonine; synthetic glycosaminoglycans; tallimustine; S-fluoroiiracil; 

20 leucovorin; tamoxifen methiodide; tauromustine; tazarotene; tecogalan sodium; tegafur; 
tellurapyrylium; telomerase inhibitors; temoporfin; temozolomide; teniposide; 
tetrachlorodecaoxide; tetrazomine; thaliblastine; fliiocoraline; thrombopoietin; 
thrombopoietin mimetic; thymalfasin; thymopoietin receptor agonist; thymotrinan; thyroid 
stimulating hormone; tin ethyl etiopurpurin; tirapazamine; titanocene bichloride; topsentin; 

25 toremifene; totipotent stem cell factor; translation inhibitors; tretinoin; triacetyluridine; 
triciribine; trimetrexate; triptorelin; tropisetron; turosteride; tyrosine kinase inhibitors; 
tyrphostins; UBC inhibitors; ubenimex; urogenital sinus-derived growth inhibitory factor, 
urokinase receptor antagonists; vapreotide; variolin B; vector system, er3lhrocyte gene 
therapy; thalidomide; velaresol; veramine; verdins; verteporfin; vinorelbine; vinxaltitne; 

30 vorozole; zanot^one; zeniplatia; zilascorb; and zinostatin stimalamer. 

Specific examples of propylactic or ther^eutic agents which can be used in 
accordance with the methods of the invention to prevent, treat, manage and/or ameliorate a 
central nervous system disorders associated with, characterized by or caused by premature 
translation termination and/or nonsense-mediated mRNA decay, or a symptom tiiereof 
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include, but are not limited to: Levodopa, L-DOPA, cocaine, a-methyl-tyrosine, reserpine, 
tetrabenazine, benzotropine, pargyline, fenodolpam mesylate, cabergoline, pramipexole 
dihydrochloride, ropinorole, amantadine hydrochloride, selegiline hydrochloride, carbidopa, 
pergolide mesylate, Sinemet CR, or Symmetrel. 
S Specific examples of propylactic or therapeutic agents which can be used in 

accordance with the methods of the invention to prevent, treat, manage and/or ameHorate a 
metabolic disorders associated with, characterized by or caused by premature translation 
termination and/or nonsense-mediated mRNA decay, or a synq>tom thereof include, but are 
not limited to: a monoamine oxidase mhibitor (MAO), for example, but not limited to, 

10 iproniazid, clorgyline, phenelzine and isocarboxazid; an acetylcholinesterase inhibitor, for 
example, but not limited to, physostigmine saliclate, physostigmine sulfate, physostigmine 
bromide, meostigmine bromide, neostigmine methylsulfate, amberionim chloride, 
edrophonium chloride, tacrine, pralidoxime chloride, obidoxime chloride, trimedoxime 
bromide, diacetyl monoxim, endrophonium, pyridostigmine, and demecarium; an anti- 

15 inflammatory agent, including, but not limited to, naproxen sodium, diclofenac sodium, 
diclofenac potassium, celecoxib, sulindac, oxaprozin, diflunisal, etodolac, meloxicam, 
ibuprofen, ketoprofen, nabumetone, refecoxib, methotrexate, leflunomide, sul&salazine, 
gold salts, RHo-D Immune Globulin, mycophenylate mofetil, cyclosporine, azathioprine, 
tacrolimus, basiliximab, daclizumab, salicylic acid, acetylsalicylic acid, methyl salicylate, 

20 diflunisal, salsalate, olsalazine, sulfasalazine, acetaxninophen, indomethacin, sulindac, 
mefenamic acid, meclofenamate sodium, totmetin, ketorolac, dichlofenac, flurbinprofen, 
oxaprozin, piroxicam, meloxicam, ampiroxicam, droxicam, pivoxicam, tenoxicam, 
phenylbutazone, oxyphenbutazone, antipyrine, aminopyrine, apazone, zileuton, 
aurothioglucose, gold sodium thiomalate, auranofin, methotrexate, colchicine, allopurinol, 

25 probenecid, sulfinpyrazone and benzbromarone or betamethasone and other 

glucocorticoids; an antiemetic agent, for example, but not limited to, metoclopromide, 
domperidone, prochlorperazine, promethazine, chlorpromazine, trimethobenzamide, 
ondansetron, granisetron, hydroxyzine, acetylleucine monoethanolamine, alizapride, 
azasetron, benzquinamide, bietanautine, bromopride, buclizine, clebopride, cyclizine, 

30 dimenhydrinate, diphenidol, dolasetron, meclizine, methallatal, metopimazine, nabilone, 
oxypemdyl, pipamazine, scopolamine, sulpiride, tetrahydrocannabinol, thiethylperazine, 
thioproperazine, tropisetron, and mixtures thereof. 

5.12 Compounds and Methods of Administering Compounds 
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Biologically active compounds identified using the methods of the invention 
or a phannaceutically acceptable salt thereof can be administered to a patient, preferably a 
mammal, more preferably a human, suffering from a disorder (in particular, a disorder 
associated with, characterized by or caused by premature translation tennrnation and/or 
5 nonsense mediated mRNA decay). In a specific embodiment, a compound or a 

pharmaceutically acceptable salt thereof is administered to a patient, preferably a m ainmal^ 
more preferably a human, as a preventative measure against a disorder (in particular, a 
disorder associated with, characterized by or caused by premature translation termination 
and/or nonsense mediated mRNA decay). 

10 In one embodiment, the compound or a pharmaceutically acceptable salt 

thereof is administered as a preventative measure to a patient. According to this 
embodiment, the patient can have a genetic predisposition to a disease, such as a family 
history of the disease, or a non-genetic predisposition to the disease. Accordingly, the 
compound and phannaceutically acceptable salts thereof can be used for the treatment of 

1 5 one manifestation of a disease and prevention of another. 

A compound identified ia accordance with the invention, or a 
pharmaceutically acceptable salt thereof, maybe a component of a composition optionally 
comprising a carrier, diluent or excipient. When administered to a patient, the compound or 
a phannaceutically acceptable salt thereof is preferably administered as component of a 

20 composition that optionaUy comprises a phannaceutically acceptable vehicle. The 

composition can be administered orally, or by any other convenient route, for example, by 
infusion or bolus injection, by absorption through epithelial or mucocutaneous linings (e.g., 
oral mucosa, rectal, and intestinal mucosa, etc.) and may be administered together with 
another biologically active agent. Administration can be systemic or local. Various 

25 delivery systems are known, e,g,, encapsulation in hposomes, microparticles, 
microcapsules, capsules, etc., and can be used to administer the compound and 
phannaceutically acceptable salts thereof. 

Methods of administration include but are not limited to intradermal, 
intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, oral, 

30 sublingual, intranasal, intracerebral, intravaginal, transdermal, rectally, by inhalation, or 
topically, particularly to the ears, nose, eyes, or skin. The mode of administration is left to 
the discretion of the practitioner. In most instances, adnodnistration will result in tiie release 
of the compound or a pharmaceutically acceptable salt thereof into the bloodstream. 
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In specific embodiments, it may be desirable to administer the compound or 
apharmaceutically acceptable salt thereof locally. This may be achieved, for example, and 
not by way of limitation, by local infusion during surgery, topical application, e,g,, in 
conjunction with a wound dressing after surgery, by injection, by means of a catheter, by 
5 means of a suppository, or by means of an implant, said implant being of a porous, non- 
porous, or gelatinous material, including membranes, such as sialastic membranes, or fibers. 

In certain embodiments, it may be desirable to introduce the compound or a 
pharmaceutically acceptable salt thereof into the central nervous system by any suitable 
route, including intraventricular, intrathecal and epidural injection. Intraventricular 

10 injection may be facilitated by an intraventricular catheter, for example, attached to a 
reservoir, such as an Ommaya reservoir. 

Pulmonary administration can also be employed, e,g.y by use of an inhaler or 
nebulizer, and formulation with an aerosohzing agent, or via perfusion in a fluorocarbon or 
synttietic pulmonary surfactant. In certain embodiments, the compound and 

15 pharmaceutically acceptable salts thereof can be formulated as a suppository, with 
traditional binders and vehicles such as triglycerides. 

In ano&er embodiment, the compound and pharmaceutically acceptable salts 
thereof can be deUvered in a vesicle, in particular a liposome (see Langer, 1990, Science 
249:1527-1533; Treat et al., in Liposomes in the Ther^y of Lifectious Disease and Cancer, 

20 Lopez-Berestein and Fidler (eds.), Liss, New York, pp. 353-365 (1989); Lopez-Berestein, 
ibid., pp. 317-327; see generally ibid.). 

Jn yet another embodiment, the compound and pharmaceutically acceptable 
salts thereof can be delivered in a controlled release system (see, e.g., Goodson, in Medical 
Applications of Controlled Release, supra, vol. 2, pp. 115-138 (1984)). Other controUed- 

25 release systems discussed in the review by Langer, 1990, Science 249:1527-1533 maybe 
used. In one embodiment, a pump may be used (see Laager, supra; Seflon, 1987, CRC 
Crit. Ref Biomed. Eng. 14:201; Buchwald et al., 1980, Surgery 88:507; Saudek et al, 1989, 
N. Engl. J. Med. 321:574). In another embodiment, polymeric materials can be used (see 
Medical Applications of Controlled Release, Langer and Wise (eds.), CRC Pres., Boca 

30 Raton, Florida (1974); Controlled Drug Bioavailability, Drug Product Design and 

Performance, Smolen and Ball (eds.), Wiley, New York (1984); Ranger and Peppas, 1983, 
J. Macromol. Sci. Rev. Macromol. Chem. 23:61; see also Levy et al., 1985, Science 
228:190; During et al., 1989, Ann. Neurol. 25:351; Howard et al., 1989, J. Neurosurg. 
71:105). In yet another embodiment, a controlled-release system can be placed in proximity 

129 




wo 2004/010106 PCT/US2003/023075 
of a targbt RNA of the compound or a phannaceutically acceptable salt thereof thus 
requiring only a fraction of the systemic dose. 

Compositions comprising the compound or a pharmaceutically a&ceptable 
salt thereof ("compound compositions") can additionally comprise a suitable amount of a 
5 pharmaceutically acceptable vehicle so as to provide the form for proper admdnistration to 
the patient 

In a specific embodiment, the term "pharmaceutically acceptable" means 
approved by a regulatory agency of the Federal or a state government or listed in the U.S. 
Pharmacopeia or other generally recognized pharmacopeia for use in animals, mammals, 

10 and more particularly in humans. The term *Vehicle" refers to a diluent, adjuvant, 
excipient, or carrier with which a compound of the invention is administered. Such 
pharmaceutical vehicles can be liquids, such as water and oils, including those of 
petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral 
oil, sesame oil and the like. The pharmaceutical vehicles can be saline, gum acacia, gelatin, 

15 starch paste, talc, keratin, colloidal silica, urea, and the like. In addition, auxiliary, 

stabilizing, thickening, lubricating and coloring agents may be used. When administered to 
a patient, the pharmaceutically acceptable vehicles are preferably sterile. Water is a 
preferred vehicle when the compound of the invention is administered intravenously. 
Saline solutions and aqueous dextrose and glycerol solutions can also be employed as liquid 

20 vehicles, particularly for injectable solutions. Suitable pharmaceutical vehicles also include 
excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica 
gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim nulk, 
glycerol, propylene, glycol, water, ethanol and the like. Compound compositions, if 
desired, can also contain minor amounts of wetting or emulsifying agents, or pH buffering 

25 agents. 

Compound compositions can take the form of solutions, suspensions, 
emulsion, tablets, pills, pellets, capsides, capsules containing liquids, powders, sustained- 
release formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other 
form suitable for use. In one embodiment, the pharmaceutically acceptable vehicle is a 
30 capsule (see eg., U.S. Patent No. 5,698,155). Other examples of suitable pharmaceutical 
vehicles are described in Remington's Pharmaceutical Sciences, Alfonso R. Gennaro, ed.. 
Mack Publishing Co. Easton, PA, 19th ed., 1995, pp. 1447 to 1676, incorporated herein by 
reference. 
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In a preferred embodiment, the compound or a phannaceutically acceptable 
salt thereof is formulated in accordance with routine procedures as a pharmaceutical 
composition adapted for oral administration to human beings. Compositions for oral 
delivery may be in the form of tablets, lozenges, aqueous or oily suspensions, granules, 
5 powders, emulsions, capsules, syrups, or elixirs, for example. Orally administered 
compositions may contain one or more agents, for example, sweetening agents such as 
fructose, aspartame or sacchariiy flavoring agents such as peppermint, oil of wintergreen, or 
cherry; coloring agents; and preserving agents, to provide a phannaceutically palatable 
preparation. Moreover, where in tablet or pill form, the compositions can be coated to 

10 delay disintegration and absorption in the gastrointestinal tract thereby providing a 
sustained action over an extended period of time. Selectively permeable membranes 
surroxmding an osmotically active driving compound are also suitable for orally 
administered compositions. In these later platforms, fluid from the enviromnent 
surrounding the capsule is imbibed by the driving compound, which swells to displace the 

15 agent or agent composition through an aperture. These delivery platforms can provide an 
essentially zero order delivery profile as opposed to the spiked profiles of immediate release 
formulations. A time delay material such as glycerol monostearate or glycerol stearate may 
also be used. Oral compositions can include standard vehicles such as mannitol, lactose, 
starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the 

20 like. Such vehicles are preferably of pharmaceutical grade. Typically, compositions for 
intravenous administration comprise sterile isotonic aqueous bufiEer. Where necessary, the 
compositions may also include a solubilizing agent. 

In another embodiment, the compound or a phannaceutically acceptable salt 
thereof can be formulated for intravenous administration. Compositions for intravenous 

25 administration may optionally include a local anesthetic such as lignocaine to lessen pain at 
the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water-free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the compound or a phannaceutically acceptable salt thereof 

30 is to be administered by infusion, it can be dispensed, for example, with an infiision bottle 
containing sterile pharmaceutical grade water or saline. Where the compound or a 
phannaceutically acceptable salt thereof is administered by injection, an ampoule of sterile 
water for injection or saline can be provided so that the ingredients may be mixed prior to 
administration. 
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The amount of a compound or a pharmaceutically acceptable salt thereof that 
will be effective in the treatment of a particular disease will depend on the nature of the 
disease, and can be determined by standard clinical techniques. In addition, in vitro or in 
vivo assays may optionally be employed to help identify optimal dosage ranges. The 
5 precise dose to be employed will also depend on the route of administration, and the 
seriousness of the disease, and should be decided according to the judgment of the 
practitioner and each patient's circumstances. However, suitable dosage ranges for oral 
administration are generally about 0.001 milligram to about 500 milligrams of a compound 
or a pharmaceutically acceptable salt thereof per kilogram body weight per day. In specific 

10 preferred embodiments of the invention, the oral dose is about 0.01 milligram to about 100 
milligrams per kilogram body weight per day, more preferably about 0.1 milligram to about 
75 milligrams per kilogram body weight per day, more preferably about 0.5 milUgram to 5 
milligrams per kilogram body weight per day. The dosage amounts described herein refer 
to total amounts administered; that is, if more than one compound is administered, or if a 

15 compound is administered with a therapeutic agent, then the preferred dosages correspond 
to the total amount administered. Oral compositions preferably contaia about 10% to about 
95% active ingredient by weight 

Suitable dosage ranges for intravenous (i.v.) administration are about 0.01 
noiUigram to about 100 milligrams per kilogram body weight per day, about 0.1 milligram 

20 to about 35 milligrams per kilogram body weight per day, and about 1 milligram to about 
10 milligrams per kilogram body weight per day. Suitable dosage ranges for intranasal 
administration are generally about 0.01 pg/kg body weight per day to about 1 mg/kg body 
weight per day. Suppositories generally contain about 0.01 miUigram to about 50 
milligrams of a compound of the invention per kilogram body weight per day and comprise 

25 active ingredient in the range of about 0.5% to about 10% by weight. 

Recommended dosages for intradermal, intramuscular, intraperitoneal, 
subcutaneous, epidural, sublingual, intracerebral, intravaginal, transdermal administration 
or administration by inhalation are in the range of about 0.001 milligram to about 200 
milligrams per kilogram of body weight per day. Suitable doses for topical administration 

30 are in the range of about 0.001 milligram to about 1 milligram, depending on the area of 
adnunistration. Effective doses may be extrapolated from dose-response curves derived 
from in vitro or animal model test systems. Such animal models and systems are well 
known in the art. 
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The compound and pharmaceutically acceptable salts thereof are preferably 
assayed in vitro and in vivo, for the desired therapeutic or prophylactic activity, prior to use 
in humans. For example, in vitro assays can be used to determine whether it is preferable to 
administer the compound, a pharmaceutically acceptable salt thereof, and/or another 
5 therapeutic agent Animal model systems can be used to demonstrate safety and efficacy. 

6. EXAMPLES 

6.1 EXAMPLE: IDENTmCATION OF A DYE-LABELED TARGET 
RNA BOUND TO SMALL MOLECULAR WEIGHT 
10 COMPOUNDS . 

The results presented in this Example indicate that gel mobility shift assays 
can be used to detect the binding of small molecules, such as the Tat peptide and 
gentamicin, to their respective target RNAs. 

Materials and Methods 

15 Buffers 

Tris-potassium chloride (TK) buffer is composed of 50 mM Tris-HCl pH 
7.4, 20mM KCl, 0.1%Triton X-100, and 0.5mM MgCk. Tris-borate-EDTA (TBE) bufifer is 
composed of 45 mM Tris-borate pH 8.0, and 1 mM EDTA. Tris-Potassium chloride- 
magnesium (TKM) bufifer is composed of 50 mM Tris-HCl pH 7.4, 20mM KCl, 0.1%Triton 
20 X-100 and 5mM MgCk. 

Gel retardation analysis 
RNA oligonucleotides were purchased from Dharmacon, Inc, Lafayette, 
CO). 500 pmole of either a 5* fluorescein labeled oligonucleotide corresponding to the 16S 
rRNA A site (5'-GGCGUCACACCUUCGGGUGAAGUCGCC-3' (SEQ ID NO: 1); 

25 Moazed & NoUer, 1987, Nature 327:389-394; Woodcock et al, 1991, EMBO J. 10:3099- 
3103; Yoshizawa et al, 1998, EMBO J. 17:6437-6448) or a 5' fluorescein labeled 
oligonucleotide corresponding to the HIV-1 TAR element TAR KISfA (5*- 
GGCAGAUCUGAGCCUGGGAGCUCUCUGCC-3' (SEQ ID NO: 2); Huq et al, 1999, 
Nucleic Acids Research. 27:1084-1093; Hwang et al, 1999, Proc. Natl. Acad. Sci. USA 

30 96:12997-13002) was 3' labeled with 5'-^^? cytidine 3', 5'-bis(phosphate) (NEN) and T4 
RNA Bgase (NEBiolabs) in 10% DMSO as per manufacturer's instructions. The labeled 
oligonucleotides were purified using G-2S Sephadex columns (Boehringer Mannheim). For 
Tat-TAR gel retardation reactions the method of Huq et al (Nucleic Acids Research, 1999, 
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27:1084-1093) was utilized with TK buffer containing O.SmM MgCl2 and a 12-mer Tat 
peptide (YGRKKRRQRRRP (SEQ JD NO: 3; single letter amino acid code). For 16S 
rRNA-gentamicin reactions, the method of Huq et al was used with TKM buffer. In 20 
reaction volumes 50 pmoles of ^^P cytidine-labeled oligonucleotide and either gentamicin 
5 sulfate (Sigma) or the short Tat peptide (Tat47-58) in TK or TKM buffer were heated at 90°C 
for 2 minutes and allow to cool to room temperature (^proximately 24*^C) over 2 hours. 
Then 10 jtd of 30% glycerol was added to each reaction tube and the entire sample was 
loaded onto a TBE non-denaturing polyacrylamide gel and electrophoresed at 1200-1600 
volt-hours at 4°C. The gel was exposed to an intensifying screen and radioactivity was 
1 0 quantitated using a Typhoon phosporimager (Molecular Dynamics). 

Background 

One method used to demonstrate small molecule interactions with natural 
occurring RNA stmctures such as ribosomes is by a method called chemical footprinting or 
toe printing (Moazed & NoUer, 1987, Nature 327:389-394; Woodcock et al, 1991, EMBO 

15 J. 10:3099-3103; Yoshizawa et al, 1998, EMBO J. 17:6437-6448). Here the use of gel 
mobility shift assays to monitor RNA-small molecule interactions are described. This 
approach allows for rapid visualization of small molecule-RNA interactions based on the 
difference between mobility of KNA alone versus RNA in a complex with a small 
molecule. To validate this approach, an RNA oligonucleotide corresponding to the well- 

20 characterized gentamicin binding site on the 16S rRNA (Moazed & NoUer, 1987, Nature 
327:389-394) and the equally well-characterized HIV-1 TAT protein bmding site on the 
HIV-1 TAR element (Huq et al, 1999, Nucleic Acids Res. 27: 1084-1093) were chosen. 
The purpose of these experiments is to lay the groundwork for the use of chromatographic 
techniques in a high througlq>ut fashion, such as microc^illary electrophoresis, for drug 

25 discovery. 

Results 

A gel retardation assay was performed using the Tatay-ss peptide and the 
TAR RNA oligonucleotide. As shown in Figure 2, in the presence of the Tat peptide, a 
clear shift is visible when the products are separated on a 12% non-denaturing 
30 polyacrylamide gel. In the reaction that lacks peptide, only the free RNA is visible. These 
observations confirm previous reports made using other Tat peptides (Hamy et al, 1997, 
Proc. Natl. Acad. Sci. USA 94:3548-3553; Huq et al, 1999, Nucleic Acids Res. 27: 1084- 
1093). 
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Based on the results of Figure 2, it was hypothesized that RNA interactions 
with small organic molecules coiild also be visualized using this method. As shown in 
Figure 3, the addition of varying concentrations of gentamicin to an RNA oligonucleotide 
corresponding to the 16S rRNA A site produces a mobiUty shift. These results demonstrate 
S that the binding of the snl^ molecule gentamicin to an RNA oligonucleotide having a 
defined structure in solution can be monitored using this approach. In addition, as shown in 
Figure 3, a concentration as low as IQng/ml gentamicin produces the mobiUty shift 
To determine whether lower concentrations of gentamicin would be 
sufficient to produce a gel shift, a similar exp^iment was performed, as shown in Figure 3, 
10 except that the concentrations of gentamicin ranged from 100 ng/ml to 10 pg/ml. As shown 
in Figure 4, gel mobility shifts are produced when the gentamicin concentration is as low as 
10 pg/ml. Further, the results shown in Figure 4 demonstrate that the shift is specific to the 
16S rRNA oligonucleotide as the use of an unrelated oUgonucleotide, corresponding to the 
HIV TAR RNA element, does not result in a gel mobility shift when incubated with 10 
15 mg/ml gentamicin. In addition, if a concentration as low as 10 pg/ml gentamicin produces a 
gel mobility shift then it should be possible to detect changes to RNA structural motife 
when small amounts of compound from a. library of diverse compounds is screened in this 
fashion. 

Further analysis of the gentamicm-KNA interaction indicates that the interaction is Mg- and 
20 temperature dependent. As shown in Figure 5, when MgCfe is not present (TK buffer), 
Img/ml of gentamicin must be added to the reaction to produce a gel shift. Similarly, Ihe 
temperature of the reaction when gentamicin is added is also in:q)ortant. When gentamicin 
is present in the reaction during the entire denaturation/renaturation cycle, that is, when 
gentamicin is added at 90C°C or 85°C, a gel shift is visualized (data not shown). In 
25 contrast, when gentamicin is added after the renaturation step has proceeded to 75°C, a 
mobility shift is not produced. These results are consistent with the notion that gentamicin 
may recognize and interact with an RNA structure formed early in the renaturation process. 

6.2 EXAMPLE: IDENTIFICATION OF A DYE-LABELED TARGET 
RNA BOUND TO SMALL MOLECULAR WEIGHT 
30 COMPOUNDS BY CAPILLARY ELECTROPHORESIS 

The results presented in this Example indicate that interactions between a 
peptide and its target RNA, such as the Tat peptide and TAR RNA, can be monitored by gel 
retardation assays in an automated capillary electrophoresis system. 

Materials and Methods 
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Buffers 

Tris-potassiiun chloride (TK) buffer is composed of 50 mM Tris-HCl pH 
7.4, 20mM KCl, 0.1%Triton X-100, and 0.5mM MgCh. Tris-boiate-EDTA (TBE) buffer is 
composed of 45 mM Tris-borate pH 8.0, and 1 mM EDTA. Tris-Potassium chloride- 
5 magnesium (TKM) buffer is composed of 50 mM Tris-HCl pH 7.4, 20mM KCl, 0.1%Triton 
X-100 and 5mMMgCl2. 



CO). 500 pmole of a 5' fluorescein labeled oligonucleotide corresponding to the HIV-1 
10 TAR element TARRNA (5'-GGCAGAUCUGAGCCUGGGAGCUCUCUGCC-3' (SEQ 
ID NO: 2); Huq et aL, 1999, Nucleic Acids Research. 27:1084-1093; Hwang ei al, 1999, 
Proc. Natl. Acad. Sci. USA 96:12997-13002) was used For Tat-TAR gel retardation 
reactions the method of Huq et al (Nucleic Acids Research, 1999, 27:1084-1093) was 
utilized with TK buffer containing 0.5mM MgCk and a 12-mer Tat peptide 
15 (YGRKKRRQRRRP (SEQ ID NO: 3); single letter amino acid code). In 20 jxl reaction 
volumes 50 pmoles of labeled oligonucleotide and the short Tat peptide (Tat47-58) in TK or 
TKM buffer were heated at 90°C for 2 minutes and allow to cool to room temperature 
(^proximately 24°C) over 2 hours. The reactions were loaded onto a SCE9610 automated 
capillary electrophoresis apparatus (SpectruMedix; State College, Pemisylvania). 

20 Results 



between a peptide and RNA can be monitored by gel retardation assays. It was 
hypothesized that interactions between a peptide and RNA could be monitored by gel 
retardation assays by an automated capillary electrophoresis system. To test tiiis 

25 hypothesis, a gel retardation assay by an automated capillary electrophoresis system was 
performed using the Tat47.58 pqptide and the TAR RNA oligonucleotide. As shown in 
Figure 6 using the capillary electrophoresis system, in the presence of the Tat peptide, a 
clear shift is visible upon the addition of increasing concentrations of Tat peptide. In the 
reaction that lacks peptide, only a peak corresponding to the free RNA is observed. These 

30 observations confirm previous reports made using other Tat peptides (Hamy et al^ 1997, 
Proc. Natl. Acad. Sci. USA 94:3548-3553; Huq et al, 1999, Nucleic Acids Res. 27: 1084- 
1093). 



Gel Retardation Analysis Using Capillary Electrophoresis 

RNA oligonucleotides were purchased from Dharmacon, Inc, Lafayette, 



As presented in the previous sections of the Example 6.1, interactions 
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6.3 EXAMPLE: COMPOUNDS THAT MODULATE 
TRANSLATION TERMINATION 
BPJD SPECIFIC REGIONS OF 28S RRNA 

Data is presented in this Example that demonstrates that specific regions of 
5 the 28S rRNA are involved in modulating translation tennination in mammalian cells. 

Compounds that interact in these regions or modulate local changes within these regions of 
the ribosome {e,g.^ alter base pairing interactions, base modification or modulate binding of 
trans-acting factors that bind to these regions) have tiie potential to modulate translation 
termination. These regions are conserved firom prokaryotes to eukaryotes, but flie role of 

1 0 these regions in modulating translation termination has not been realized in eukaryotes. In 
bacteria, when a short RNA jfragment, complementary to the E. colt 238 rRNA segment 
comprising nucleotides 735 to 766 (in domain n), is expressed in vivo, suppression of UGA 
nonsense mutations, but not UAA of UAG, results (Chemyaeva et aL, 1999, J Bacteriol 
181 :5257-5262). Other regions of the 23S rRNA in E. coli have been implicated in 

IS nonsense suppression including the GTPase center in domain n (nt 1034-1 120; Jemiolo et 
al, 1995, Proc. Nat. Acad. Sci. 92:12309-12313). 

Materials and Methods 

Small Molecttles Involved in Modulating Translation T ftrminatfoii 

Small molecules involved in modulating translation termination, Le.y 
20 nonsense siqppression, were used in the footprinting experiments presented in Figures 2 to 6 
and are listed as Compoimd A (molecular formula C19H21NO4), Compound B (molecular 
formula C19H21N2O5), Compound C (molecular formula C12H15N5O), Compound D 
(molecular formula C23Hi503Br), Compound E (molecular formula C19H21NO4), Compound 
F, Compound G (molecular formula C12H15N5O), Compound H (molecular formula 
25 C23H15NO5), Compound I (molecular formula C23H15NO5), Compound J, and Compound K. 

Preparation of a Translatioa Extract from HeLa cells 
HeLa S3 cells were grown to a density of 10^ cells/ml in DMEM; 5%C02, 
10%FBS, IX P/S in a spinner flask. Cells were harvested by spinning at lOOOXg. Cells 
were washed twice with phosphate buffered saline. The cell pellet was on ice for 12-24 
30 hours before proceeding. By letting the cells sit on ice, the activity of the extract is 

increased up to twenty-fold. The length of time on ice can range from 0 hours to 1 week. 
The cells were resuspended in 1.5 volumes (packed cell volume) of hypotonic buffer (10 
mM HEPES (KOH) pH 7.4; 15 mM KCl; 1.5 mM Mg(0Ac)2; 0.5 mM Pefabloc (Roche); 2 
naM DTT). The cells were allowed to swell for 5 minutes on ice, dounce homogenized with 
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10 to 100 strokes using a tight-fitting pestle, and spun for 10 minutes at 12000Xg at 4°C in 
a Sorvall SS-34 rotor. The supernatant was collected with a Pasteur pipet without 
disturbing the lipid layer, transferred into Eppendorf tubes (50 to 200 xnl aliquots), and 
immediately frozen in liquid nitrogen. 

5 Footprintmg 

Ribosomes prepared from HeLa cells were incubated with the small 
molecules (at a concentration of 100 /xM), followed by treatment with chemical modifying 
agents (dimethyl sulfate [DMS] and kethoxal [KE]). Following chemical modification, 
rRNA was phenol-chloroform extracted, ethanol precipitated, analyzed in primer extension 

10 reactions using end-labeled oligonucleotides hybridizing to different regions of the rKNAs 
and resolved on 6% polyacrylamide gels. The probes used for primer extension cover the 
entire 18S (7 oligonucleotide primers), 28S (24 oligonucleotide primers), and SS (one 
primer) rKNAs are presented in Table 1 (also see, e.g., Gonzalez et al., 1985 Proc Natl 
Acad Sci USA. 82(22):7666-70 and McCallum & Maden, 1985, Biochem. J. 232 (3): 

1 5 725-733). Controls in these experiments include DMSO (a control for changes in rRNA 
accessibility induced by DMSO), paromomycin (a marker for 18S rRNA binding), and 
anisomycin (a marker for 28S rRNA binding). 



Table 1: 


18S, 28S, and 5S rRNApiimere 






5S#1 


AAAGCCTACAGCACCC 


SEQIDNO.: 


4 


28S#1 


TACTGAGGGAATCCTGG 


SEQIDNO.: 


5 


28S#2 


TTACCACCCGCTTTGGG 


SEQIDNO.: 


6 


28S#3 


GGGGGCGGGAAAGATCC 


SEQIDNO.: 


7 


28S#4 


CCCCGAGCCACCTTCCC 


SEQIDNO.: 


8 


28S#5 


GGCCCCGGGATTCGGCG 


SEQIDNO.: 


9 


28S#6 


CACTGGGGACAGTCCGC 


SEQIDNO.: 


10 


28S#7 


CGCGGCGGGCGAGACGGG 


SEQIDNO.: 


11 


28S#8 


GAGGGAAACTTCGGAGGG 


SEQIDNO.: 


12 


28S#9 


CATCGGGCGCCTTAACCC 


SEQIDNO.: 


13 


28S#10 


CGACGCACACCACACGC 


SEQIDNO.: 


14 


28S#11 


CCAAGATCTGCACCTGC 


SEQIDNO.: 


15 


28S#12 


TTACCGCACTGGACGCC 


SEQIDNO.: 


16 


28S#13 


GCCAGAGGCTGTTCACC 


SEQIDNO.: 


17 


28S#14 


TGGGGAGGGAGCGAGCGGCG 


SEQIDNO.: 


18 


28S#15 


AAGGGCCCGGCTCGCGTCC 


SEQIDNO.: 


19 


28S#16 


AGGGCGGGGGGACGAACCGC 


SEQIDNO.: 


20 


28S#17 


TTAAACAGTCGGATTCCCCTGG 


SEQIDNO.: 21 
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28S#18 


TTCATCCATTCATGCGCG 


SEQ ID NO.: 


22 


28S#19 


AGTAGTGGTATTTCACCGG 


SEQEJNO.: 


23 


28S#20 


ACGGGAGGTTTCTGTCC 


SEQIDNO.: 


24 


28S#21 


ACAATGATAGGAAGAGCCG 


SEQIDNO.: 


25 


28S#22 


AGGCGTTCAGTCATAATCCC 


SEQIDNO.: 


26 


28S#23 


TCCGCACCGGACCCCGGTCC 


SEQIDNO.: 


27 


28S#24 


GGGCTAGTTGATTCGGCAGGTGAGTTG SEQIDNO.: 


28 


18iS#l 


TCTCCGGAATCGAACCCT 


SEQIDNO.: 


29 


18S#2 


ATT ACC GCGGCTGCTGGC 


SEQIDNO.: 


30 


18S#3 


TTGGCAAATGCTTTCGC 


SEQIDNO.: 


31 


18S#4 


CCGTCAATTCCTTTAAGTTTC 


SEQIDNO.: 


32 


18S#S 


AGGGCATCACAGACCTGTTAT 


SEQIDNO.: 


33 


18S#6 


CGACGGGCGGTGTGTAC 


SEQIDNO.: 


34 


18S#7 


CCGCAGGTTCACCTACGG 


SEQIDNO.: 


35 



Results 

The results of these foot-printing experiments (see, e,g.. Figures 7 to 11) indicated 
that the small molecules involved in modulating translation termination alter the 
5 accessihility of the chemical modifying agents to specific nucleotides in the 28S rRNA.. 
More specifically, the regions protected by the small molecules include a conserved region 
in the vicinity of the peptidyl transferase center (domain V, see, e.g.y Figures 7 and 8) 
implicated in peptide bond formation and a conserved region in domaia II (see, e.g., Figures 
9, 10, and 1 1) that may interact with the peptidyl transferase center based on binding of 
10 vemamycin B to both these areas (Vannuffel et al., 1994, Nucleic Acids Res. 
22(21):4449-53). 

6.4 EXAMPLE: HIGH THROUGHPUT IDENTIFICATION OF 
15 COMPOUNDS USING ARRAYS 

To identify molecules of the invention, high througt^jut assaj^ that enable 
each compound to be screened against many different nucleic acids in a parallel manner are 
used. In brief, synthesis beads, with compounds of the invention attached, are distributed 

20 into miCTo titer plates at a density of one bead per well. Compounds of the invention that 
are attached to the beads are then released &om the beads and dissolved in a small amount 
of solvent in each microtiter well. A high precision technique, such as a robotic arrayer, is 
then used to transfer small volumes of solution containing dissolved compounds of the 
invention firom each microtiter well, delivering the compounds to defined locations on glass 

25 slides. The glass slides are derivatized so that the compounds of the invention are 
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immobilized on the surface of the slide. Each compoimd contains a functional group that 
allows for its immobilization on the glass slide. Each slide is then probed with a labeled 
RNA and binding events are detected by, eg., a fluorescence-linked assay that is able to 
detect the label. 



6.5 EXAMPLE: HUMAN DISEASE GENES SORTED BY CHROMOSOME 



Table 2: Genes, Locations and Genetic Disorders on Chromosome 1 



Gene 


GDB Accession ID 


OMIMLink 


ABCA4 


GDB:370748 


MACULAR DEGENERATION, SENILE 
STARGARDT DISEASE 1; STGDl ATP 
BINDING CASSETTE TRANSPORTER; 
ABCR RETINmS PIGMENTOSA-19; 


ABCD3 


GDB:131485 


PEROXISOMAL MEMBRANE 

PPHTPTNT 1 • PVMPl 




\JUD,L Loyjo 


MEDIUM-CHAIN; ACADM 


AGL 


GDB:132644 


GLYCOGEN STORAGE DISEASE IE 


ACT 


GDB:1 18750 


ANGIOTENSIN I; AGT 


ALDH4A1 


GDB:9958827 


HYPERPROLINEMIA, TYPE E 


ALPL 


GDB:1 18730 


PHOSPHATASE, UVER ALKALINE; 
ALPL HYPOPHOSPHATASIA, 
INFANTILE 


AMPDl 


GDB: 119677 


ADENOSINE MONOPHOSPHATE 
DEAMINASE-1; AMPDl 


AP0A2 


GDB:119685 


APOLIPOPROTEIN A-H; AP0A2 


AVSDl 


GDB:265302 


ATRIOVENTRICULAR SEPTAL 
DEFECT; AVSD 


BRCD2 


GDB:9955322 


BREAST CANCER, DUCTAL, 2; BRCD2 


CIQA 


GDB:119042 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, ALPHA 
POLYPEPTIDE; CIQA 


CIQB 


GDB:119043 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, BETA 
POLYPEPTIDE; CIQB 


CIQG 


GDB:128132 


COMPLEMENT COMPONENT 1, q 
SUBCOMPONENT, GAMMA 
POLYPEPTIDE; CIQG 


C8A 


GDB:1 19735 


COMPLEMENT COMPONENT-8, 
DEFICIENCY OF 


C8B 


GDB:119736 


COMPLEMENT COMPONENT-8, 
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Gene 


GDB Accession ID 


OMIMLink 








DEFICIENCY OF, TYPE H 


CACNAIS 


GDB:126431 


CALCIUM CHANNEL, 

ALPHA IS SUBUNTT; CACNAIS 
PERIODIC PARALYSIS I MALIGNANT 
HYPERTHERMIA SUSCEPTIBILITY-5; 

MHS5 


CCV 


GDB:1336655 


CATARACT, CONGENITAL, 
VOLKMANN TYPE; CCV 


CD3Z 


GDB:119766 


CD3Z ANTIGEN, ZETA POLYPEPTIDE; 
CD3Z 


CDC2L1 


GDB:127827 


PROTEIN KINASE p58; PK58 


CHML 


GDB:135222 


CHOROIDEREMIA-LIKE; CHML 


CHSl 


GDB:4568202 


CHEDIAK-HIGASHI SYNDROME; 
CHSl 


CIASl 


GDB:9957338 


COLD HYPERSENSITIVITY 
URTICAIOA, DEAFNESS, AND 
AMYLOIDOSIS 




CLCNKB 


GDB:698472 


CHLORIDE CHANNEL, KIDNEY, B; 
CLCNKB 




CMDIA 


GDB:434478 


CARDIOMYOPATHY, DILATED IA4 
CMDIA 


CMH2 


GDB:137324 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 2; CMH2 


(MM 


GDB:119059 


MELANOMA, MAUGNANT 


COLllAl 


GDB:120595 


COLLAGEN, TYPE XI, ALPHA-1; 
COLllAl 




COL9A2 


GDB: 1383 10 


COLLAGEN, TYPE IX, ALPHA-2 
CHAIN; COL9A2 EPIPHYSEAL 
DYSPLASIA, MULTIPLE, 2; EDM2 




CPT2 


GDB:127272 


MYOPATHY WIIH DEFICIENCY OF 

CARNITINE 

PALMITOYLTRANSFERASE E 
HYPOGLYCEMIA, HYPOKETOTIC, 
WITH DEFICIENCY OF CAKNmNB 
PALMITOYLTRANSFERASE 
CARNITINE 

PALMITOYLTRANSFERASE H; CPT2 


CRBl 


GDB:333930 


RETINITIS PIGMENTOSA-12; RP12 




CSE 


GDB:596182 


CHOREOATHETOSIS/SPASTICrrY, 
EPISODIC; CSE 
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Gene 


GDB Accession ID 


OMIM Link 




CSF3R 


(333:126430 


COLONY STIMULATING FACTOR 3 
RECEPTOR, GRANULOCYTE; CSF3R 




CTPA 


GDB:9863168 


CATARACT, POSTERIOR POLAR. 




CTSK 


GDB:453910 


PYCNODYSOSTOSIS CATHEPSDSTI^ 
CTSK 




DBT 


GDB:118784 


MAPLE SYRUP URINE DISEASE, TYPE 
2 




DIOl 


GDB: 136449 


THYROXINE DEIODINASE TYPE I; 
TXDIl 




DISCI 


GDB:9992707 


DISORDER-2; SCZD2 






DPYD 


GDB:364102 


DIHYDROPYRIMIDINE 
DEHYDROGENASE; DPYD 




EKV 


GDB:1 19106 


ERYTHROKERATODERMIA 
VAEOABILIS; EKV 




ENOl 


GDB:1 19871 


PHOSPHOPYRUVATE HYDRATASE; 
PPH 




ENOIP 


GDB:135006 


PHOSPHOPYRUVATE HYDRATASE; 
PPH 




EPB41 


GDB:119865 


ERYTHROCYTE MEMBRANE 
PROTEIN BAND 4.1; EPB41 
HEREDITARY HEMOLYTIC 




EPHXl 


GDB:1 19876 


EPOXIDE HYDROLASE 1, 
MICROSOMAL; EPHXl 




F13B 


GDB: 119893 


FACTOR Xm, B SUBUNTT; F13B 




F5 


GDB: 119896 


FACTOR V DEFICIENCY 




FCGR2A 


GDB:119903 


Fc FRAGMENT OF IgG, LOW 
AFFINITY na, RECEPTOR FOR; 
FCGR2A 




FCGR2B 


GDB:128183 


Fc FRAGMENT OF IgG, LOW 
APFINITY na, RECEPTOR FOR; . 
FCGR2A 




FCGR3A 


GDB: 119904 


Fc FRAGMENT OF IgG, LOW 
AFFINITY ma, RECEPTOR FOR; 
FCGR3A 




FCHL 


GDB:9837503 


HYPERUPIDEMIA, COMBINED 






FH 


GDB:119133 


FUMARATE HYDRATASE; FH 
LEIOMYOMATA, HEREDITARY 
MULTIPLE, OF SKIN 




EM03 


GDB:135136 


FLAVIN-CONTAINING 
MONOOXYGENASE 3; FM03 
TRIMETHYLAMINURIA 
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Gene 


GDB Accession ID 


OMIM Link 




FM04 


GDB: 127981 


FIAVIN-CONTAINING 
MONOOXYGENASE 2; FM02 


FUCAl 


GDB: 119237 


FUCOSIDOSIS 


FY 


GDB: 11 9242 


BLOOD GROUP—DUFFY SYSTEM; Fy 


GALE 


GDB:119245 


GALACTOSE EPIMERASE 
DEHCffiNCY 


GBA 


GDB:119262 


GAUCHER DISEASE, TYPE I; GD I 




GFND 


GDB:9958222 


GLOMERULAR NEPHRITIS, 
FAMILIAL, Wrra FIBRONECTIN 
DEPOSITS 




GJA8 


GDB:696369. 


CATARACT, ZONULAR 
PULVERULENT 1; CZPl GAP 
JUNCTION PROTEIN, ALPHA-8, 
50-KD; GJA8 


GJ63 


GDB:127820. 


ERYTHROKERATODERMIA 
VARIABILIS; EKV DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
2;DFNA2 






GLC3B 


GDB:3801939 


GLAUCOMA 3, PRIMARY INFANTILE, 
B; GLC3B 




HFl 


GDB: 120041 


H FACTOR 1; HFl 


HMGCL 


GDB:138445 


HYDROXYMETHYLGLUTARICACIDU 
RIA; HMGCL 


HPCl 


GDB:5215209 


PROSTATE CANCER4 PRCAl 
PROSTATE CANCER, HEREDITARY 1 


HRD 


GDB:9862254 


HYPOPARATHYROIDISM WITH 
SHORT STATURE, MENTAL 
RETARDATION, AND SEIZURES 


HRPT2 


(336:125253 


HYPERPARATHYROIDISM, FAMILIAL 
PRIMARY, wrra MULTIPLE 
OSSIFYING JAW 


HSD3B2 


GDB: 134044 


ADRENAL HYPERPLASIA E 


HSPG2 


GDB:126372 


HEPARAN SULFATE 
PROTEOGLYCAN OF BASEMENT 
MEMBRANE; HSPG2 MYOTONIC 
MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND 
OCULAR AND FACIAL 


KCNQ4 


GDB:439046 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
2;DFNA2 
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Gene 


GDB Accession ID 


OMIMLink 






KCS 


GDB:9848740 


KENNY-CAFFEY SYNDROME, 
RECESSIVE FORM 




KIFIB 


GDB:128645 


CHARCOT-MARIE-TOOTH DISEASE, 
NEURONAL TYPE, A; CMT2 A 




LAMBS 


GDB:251820 


LAMININ, BETA 3; LAMB3 




LAMC2 


GDB:136225 


LAMININ, GAMMA 2; LAMC2 
EPIDERMOLYSIS BULLOSA LETALIS 


LGMDIB 


GDB:231606 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE IB; LGMDIB 




GDB:13214o 


LAMDSr A/C; LMNA LIPODYSTROPHY, 
FAMILIAL PARTIAL, DUNNIGAN 
TYPE; LDPl 


LOR 


GDB: 132049 


LORICRIN; LOR 


MCKDl 


GDB:9859381 


POLYCYSTIC KIDNEYS, 
MEDULLARY TYPE 


MCLl 


GDB:139137 


MYELOID CELL LEUKEMIA 1; MCLl 


MPZ 


GDB: 125266 


HYPERTROPHIC NEUROPATHY OF 
DEJERINE-SOTTAS MYELIN PROTEIN 
ZERO; MPZ 


MTHFR 


GDB:370882 


5,10-@METHYLENETETflAHYDROFO 
LATE REDUCTASE; MIHFR 


MTR 


GDB:119440 


METHYLTETRAHYDROFOLATE:L-HO 
MOCYSTEINE 

S-METHYLTRANSFERASE; MTR 


MUTYH 


GDB:9315115 


ADENOMATOUS POLYPOSIS OF THE 
COLON; APC 




MYOC 


GDB:5584221 


GLAUCOMA 1, OPEN ANGLE; GLCIA 
MYOCILIN; MYOC 




NB 


GDB:9958705 


NEUROBLASTOMA; NB 


NCF2 


GDB:120223 


GRANULOMATOUS DISEASE, 
CHRONIC, AUTOSOMAL 
CYTOCHROME-b-POSrriVB FORM 


NEMl 


GDB:127387 


NEMALINE MYOPATHY 1, 
AUTOSOMAL DOMINANT; NEMl 


NPHS2 


GDB:9955617 


ARRHYTHMOGENIC RIGHT 
VENTRICULAR DYSPLASL*^ 
FAMILIAL, 2; ARVD2 




NPPA 


GDB:118727 


NATRIURETIC PEPTIDE PRECURSOR 
A; NPPA 
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Gene 


GDB Accession ID 


OMIMLiiik 




NRAS 


GDB:1 19457 


ONCOGENE NRAS; NRAS; NRASl 




NTRKl 


GDB: 127897 


ONCOGENE TRK. NEUROTROPHIC 
TYROSINE KINASE, RECEPTOR, TYPE 
1 ; NTRKl NEUROPATHY, 
CONGENITAL SENSORY, WITH 
ANHIDROSIS 




0PTA2 


GDB:9955577 


OSTEOPETROSIS, AUTOSOMAL 

DUMJUN AJN 1 , 1 YrxS il; (JPA2 




PBXl 


GDB:125351 


PRE-B-CELL LEUKEMIA 
TRANbCRIFTION FACTOR-1; PBXl 




PCHC 


GDB:9955586 


PHEOCHROMOCYTOMA 




PGD 


GDB: 119486 


6-@PHOSPHOGLUCONATE 
DEHYDROGENASE, ERYTHROCYTE 




PHA2A 


GDB:9955628 


PSEUDOHYPOALDOSTERONISM, 
TYPEn;PHA2 




PHGDH 


GDB:9958261 


3-@PH0SPH0GLYCERATE 
DEHYDROGENASE DEFICIENCY 




PKLR 


GDB: 120294 


PYRUVATE KINASE DEFICIENCY OF 
ERYTHROCYTE 




PKPl 


GDB:4249598 


PLAK0PHILIN1;PKP1 




PLA2G2A 


GDB:120296 


PHOSPHOUPASE A2, GROUP EA; 

PLA2G2A 




PLOD 


GDB:127821 


PROCOLLAGEN-LYSINE, 
2-OXOGLUTARATE 5-DIOXYGENASE; 
PLOD EHLERS-DANLOS SYNDROME, 
TYPE Vt E-D VI; EDS VI 




PPOX 


GDB:1 18852 


PROTOPORPHYRINOGEN OXIDASE; 
PPOX 




PPT 


GDB:125227 


CEROID-LIPOFUSCINOSIS, 
NEURONAL 1, INFANTILE; CLNl 
PALMITOYL-PROTEIN 
THIOESTERASE; PPT 




PRCC 


GDB:3888215 


PAPILLARY RENAL CELL 
CARCINOMA; PRCC 




PRG4 


GDB:9955719 


ARTHROPATHY-CAMPTODACTYLY 

SYNDROME 






PSEN2 


GDB:633044 


ALZHEIMER DISEASE, FAMILIAL, 
TYPE 4; AD4 




PTOSl 


GDB:6279920 


PTOSIS, HEREDITARY CONGENITAL 
1; PTOSl 




REN 


GDB:120345 


RENIN; REN 




RFX5 


GDB:6288464 


REGULATORY FACTOR 5; RFX5 
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Gene 


\^iJM> /vccessioii 111 


iw/rrx/i T Snlr 
iilVlJiVl XjUiK 


RHD 


GDB:1 19551 


RHESUS BLOOD GROUP, D ANTIGEN; 
RHD 


RMDl 


GDB:448902 


RIPPLING MUSCLE DISEASE-1 ; RMDl 


RPE65 


GDB:226519 


RETINAL PIGMENT 
EPITHELIUM-SPECIFIC PROTEIN, 
65-KD; RPE65 AMAUROSIS 


SCCD 


GDB:9955558 


CORNEAL DYSTROPHY, 
CRYSTALLINE, OF SCHNYDER 


SERPINCl 


GDB:1 19024 


ANTITHROMBIN HI DEFICIENCY 


SJSl 


GDB:1381631 


MYOTONIC MYOPATHY, DWARFISM, 
CHONDRODYSTROPHY, AND 
OCULAR AND FACL^L 


SLC19A2 


GDB:9837779 


THIAMINE-RESPONSIVE 
MEGALOBLASTIC ANEMIA 
SYNDROME 


SLC2A1 


GDB:120627 


SOLUTE CARRIER FAMILY 2, 
MEMBER 1; SLC2A1 


SPTAl 


GDB: 119601 


ELLIPTOCYTOSIS, 
RHESUS-UNLINKED TYPE 
HEREDITARY HEMOLYTIC 
SPECTRIN, ALPHA, ERYTHROCYTIC 
1; SPTAl 


TALI 


GDB:120759 


T-CELL ACUTE LYMPHOCYTIC 

LbUJsJiMiA 1; lALl 


TNFSF6 


GDB:422178 


APOPTOSIS ANTIGEN LIGAND 1; 

Aiypi T 
ArlllAjrl 


TNNT2 


GDB:221879 


TROPONIN-T2, CARDIAC; TNNT2 


TPM3 


GDB:127872 


ONCOGENE TRK TROPOMYOSIN 3; 
TPM3 


TSHB 


GDB:120467 


THYROID-STIMULATING HORMONE, 

"DTJ'T* A f^TJf A TXT. TiCTTD 

Bbl A CHAIN; TSHB 


UMrJN. 




UKlDiNJtl MONOFHOorrlArfi KINASE; 
UMPK 


uox 


GDB:127539 


URATE OXIDASE; UOX 


UROD 


GDB:1 19628 


PORPHYIUA CUTANEA TARDA; PCT 


T TQTIO A 




UdilcR iSYNDROMfi, TYPE u; USH2 


VMGLOM 


GDB:9958134 


GLOMUS TUMORS, MULTIPLE 


VWS 


GDB:120532 


CLEFT LIP AND/OR PALATE WITH 
MUCOUS CYSTS OF LOWER LIP 


WS2B 


GDB:407579 


WAARDENBURG SYNDROME, TYPE 
2B;WS2B 
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Table 3: Genes, Locations and Genetic Disorders on Chromosome 2 



Gene 


GDB Accession ID 


Location 


OMIMLink 


ABCBll 


GDB:9864786 


2q24-2q24 
2q24.3-2q24.3 


CHOLESTASIS, PROGRESSIVE 
FAMILIAL INTRAHEPATIC 2; 
PFIC2 


ABCG5 


GDB: 10450298 


2t)21-2p21 


PHYTOSTEROLEMIA 


ABCG8 


GDB:10450300 


2p21-2p21 


PHYTOSTEROLEMIA 


ACADL 


GDB: 118745 


2q34-2q35 


ACYL-CoA 

LONG-CHAIN, DEFICIENCY 
OF 


ACPI 


GDB-118Q62 


2r>25-2n25 


ERYTHROCYTE; ACPI 


AGXT 


GDB:127113 


2q37.3-2q37.3 


OXALOSIS I 


AHHR 


GDB: 118984 


2pter-2q31 


CYTOCHROME P450, 
SUBFAMILY I, POLYPEPTIDE 
1 . CWI A 1 

A, \^ X X X/\i 


ALMSl 


GDB:9865539 


2pl3-2pl2 
2pl3.1-2pl3.1 


ALSTROM SYNDROME 


ALPP 


GDB -119672 


2a37 \-2a%l 1 


AT.KAT TMF PKO^PTTATA W 
PLACENTAL; ALPP 


ALS2 


GDB- 135696 




AMYOTPOPTTrr T ATFP AT 
SCLEROSIS 2, JUVENILE; ALS2 


APOB 


GDB:119686 


2p24-2p23 
2n24-2n24 


APOLIPOPROTEIN B; APOB 


BDE 


GDB:9955730 


2q37-2q37 


BRACHYDACTYLY, TYPE B; 


BDMR 


GDB:533064 


2q37-2q37 


BRACHYDACTYLY-MENTAL 
RETARDATION SYNDROME; 
BDMR 


BJS 


GDB:9955717 


2q34-2q36 


TORTI AND NERVE 
DEAFNESS 


BMPR2 


GDB:642243 


2q33-2q33 
2q33-2q34 


PULMONARY 

HYPERTENSION, PRIMARY; 
PPHl BONE MORPHOGENETIC 
RECEPTOR TYPE H; BMPR2 


CHRNAl 


GDB: 120586 


2q24-2q32 


CHOUNERGIC RECEPTOR, 
NICOTINIC, ALPHA 
POLYPEPTIDE 1; CHRNAl 


CMCAVTD 


GDB:11498919 


2p22.3-2p21 


A > /TTT TAT /^TTT» A'W.TT/^ 

FAMILIAL CHRONIC 
MUCOCUTA^fEOUS, 
DOMINANT TYPE 


CNGA3 


GDB:434398 


2qll.2-2qll.2 


COLORBLINDNESS, TOTAL 

CYCLIC NUCLEOTIDE GATED 
CHANNEL, OLFACTORY, 3; 
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Gene 


GDB Accession ID 


Location 


OMIMLink 








CNG3 


C0L3A1 


GDB: 118729 


2q31-2q32,3 
2q32.2-2q32.2 


COLLAGEN, TYPE IE; C0L3 Al 
EHLERS-DANLOS 
SYNDROME, TYPE IV, 
AUTOSOMAL DOMINANT 


COL4A3 


GDB: 128351 


2q36-2q37 


COLLAGEN, TYPE IV, 
ALPHA-3 CHAIN; COL4A3 


C0L4A4 


GDB:132673 


2q35-2q37 


COLLAGEN, TYPE IV, 
ALPHA-4 CHAIN; COL4A4 


COL6A3 


GDB: 119066 


2q37.3-2q37.3 


COLLAGEN TYPE VI 
ALPHA-3 CHAIN; COL6A3 
MYOPATHY, BENIGN 
CONGENITAL, WITH 
CONTRACTURES 


CPSl 


GDB:119799 


2q33-2q36 
2q34-2q35 
2q35-2q35 


HYPERAMMONEMLA DUE TO 
CARBAMOYLPHOSPHATE 
SYNTHETASE I DEnCIENCY 


CRYGA 


GDB: 119076 


2q33-2q35. 


CRYSTALLDSr, GAMMA A; 
CRYGA 


CRYGEPl 


GDB:119808 


2q33-2q35 


CRYSTALLIN, GAMMA A; 
CRYGA 


CYPIBI 


GDB:353515 


2p21-2p21 
2d22-2d21 
2pter-2qter 


GLAUCOMA 3, PRIMARY 
INFANTILE A: GLC3A 
CYTOCHROME P450, 
SUBFAMILY I 
(DIOXIN-INDUCIBLE), 
POLYPEPTIDE 1; CYPIBI 


CYP27A1 


GDB:128129 


2q33-2qter. 


CEREBROTENDINOUS 
XANTHOMATOSIS 


DBI 


GDB:119837 


2ql2-2q21. 


DIAZEPAM BINDING 
INHIBITOR; DBI 


DES 


GDB:119841 


2q35-2q35 


DESMIN; DES 


DYSF 


GDB:340831 


2p-2p 

2pl3-2pl3 

2pter-2pl2 


MUSCULAR DYSTROPHY 
LIMB-GIRDLE, TYPE 2B; 
LGMD2B MUSCULAR 
DYSTROPHY, LATE-ONSET 
DISTAL 


EDAR 


GDB:9837372 


2qll-2ql3 


DYSPLASIA, HYPOHIDROTIC 
ECTODERMAL DYSPLASIA, 
ANHIDROTIC 


EFEMPl 


GDB:1220111 


2pl6-2pl6 


DOYNE HONEYCOMB 
DEGENERATION OF RETINA 
FIBRILLIN-LIKE; FBNL 


EIF2AK3 


GDB:9956743 


2pl2-2pl2 


EPIPHYSEAL DYSPLASIA, 
MULTIPLE, WTIH 
EARLY-ONSET DL\BETES 
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Gene 


GDB Accession ID 


Location 


OMIMLin]£ 








MELLirUS 


ERCC3 


GDB:119881 


2q21-2q21 


EXCISION-REPAIR, 
COMPLEMENTING 
DEFECTIVE, IN CHINESE 
HAMSTER, 3; ER.CC3 


FSHR 


aDB:127510 


2p21-2pl6 


FOLLICLE-STIMULATING 
HORMONE RECEPTOR; FSHR 
GONADAL DYSGENESIS, XX 
TYPE 


GADl 


GDB: 119244 


2q31-2q31 


PYRIDOXINE DEPENDENCY 
WITH SEIZURES 


GINGF 


Ga)B:9848875 


2p21-2p21 


GINGIVAL SON OF 
SEVENLESS (DROSOPHILA) 
HOMOLOG 1; SOSl 


GLCIB 


GDB:1297553 


2ql-2ql3 


GLAUCOMA 1, OPEN ANGLE, 

B; GLCIB 


GPD2 


GDB:354558 


2q24.1-2q24.1 


GLYCEROL-3-PHOSPHATE 
DEHYDROGENASE-2; GPD2 


GYPC 


GDB: 120027 


2ql4-2q21 


BLOOD GROUP-GERBICH; Ge 








HYDROXYACYT -PnA 
DEHYDROGENASE/3-KETOAC 

YL-CoA 

THIOLASE/ENOYL-CoA 
HYDRATASE, 




mriQ.iAAaKi 




DEHYDROGENASE/3-KETOAC 
YL-CoA 

THIOLASE/ENOYL-CoA 
HYDRATASE, 


H0XD13 


GDB:127225 


2q31-2q31 


HOMEO B0X-D13; H0XD13 
SYNDACTYLY, TYPE H 


HPE2 


GDB: 136066 


2p21-2p21 


MIDLINE CLEFT SYNDROME 


IGKC. 


GDB: 120088 


2pl2-2pl2 
2pll.2-2pll.2 


IMMUNOGLOBULIN KAPPA 
CONSTANT REGION; IGKC 


IHH 


GDB:511203 


2q33-2q35 
2q35-2q35 
2pter-2qter 


BRACHYDACTYLY, TYPE Al; 
BDAl INDIAN HEDGEHOG, 
DROSOPHILA, HOMOLOG OF; 
IHH 


msi 


(©B:133974 


2q36-2q36 


INSULIN RECEPTOR 


ITGA6 


(SDB:128027 


2pter-2qter 


INTEGRBSr, ALPHA-6; ITGA6 


KHK 


GDB:391903 


2p23.3-2p23.2 


FRUCTOSURIA 


KYNU 


GDB:9957925 


2q22.2-2q23.3 




LCT 


GDB:120140 


2q21-2q21 


DISACCHARIDE 
INTOLERANCE n 
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GDB Accession ID 
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OMIMLink 


LHCCSR 


GDB:125260 


2p21-2p21 


LUTEINIZING 

HORMONE/CHORIOGONADOT 
ROPIN RECEPTOR; LHCGR 


LSFC 


GDB:9956219 


2-2 2pl6-2pl6 


CYTOCHROME c OXIDASE 
DEFICIENCY, 

FRENCH-CANADIAN TYPE 


MSH2 


GDB:203983 


2pl6-2pl6 
2p22-2p21 


COLON CANCER, FAMILIAL, 
NONPOLYPOSIS TYPE 1; FCCl 


MSH6 


GDB:632803 


2pl6-2pl6 


G/T MISMATCH-BINDING 
PROTEIN; GTBP 


NEB 


GDB:120224 


2q24.1-2q24.2 


NEBULIN; NEB NEMALINE 
MYOPATHY 2, AUTOSOMAL 
RECESSIVE; NEM2 


NMTC 


GDB:11498336 


2q21-2q21 


THYROID CARCINOMA, 
PAPILLARY 


NPHPl 


GDB:128050 


2ql3-2ql3 


NEPHRONOPHTHISIS, 
FAMILIAL JUVENILE 1; NPHPl 


PAFAHIPI 


GDB:435099 


2pll.2-2pll.2 


PLATELET-ACTIVATING 
FACTOR 

ACETYLHYDROLASE, 
GAMMA SUBUNTT 


PAX3 


GDB:120495 


2q36-2q36 
2q35-2q35 


KLEIN-WAARDENBURG 
SYNDROME WAARDENBURG 
SYNDROME; WSl 


PAX8 


GDB:136447 


2ql2-2ql4 


PAIRED BOX HOMEOTIC 
GENE 8; PAX8 


PMSl 


GDB:386403 


2q31-2q33 


POSTMEIOTIC SEGREGAHON 
INCREASED (S. 
CEREVISIAE)-1;PMS1 


PNKD 


GDB:5583973 


2q33-2q35 


CHOREOATEIETOSIS, 
FAMILIAL PAROXYSMAI4 
FPDl 


PPHl 


GDB:1381541 


2q31-2q32 
2q33-2q33 


PULMONARY 

HYPERTENSION, PRIMARY; 
PPHl 


PROC 


GDB:120317 


2ql3-2q21 
2ql3-2ql4 


PROTEIN C DEFICIENCY, 
CONGENITAL THROMBOTIC 
DISEASE DUE TO 


REGIA 


GDB:132455 


2pl2-2pl2 


REGENERATESfG 
ISLET-DERIVED 1-ALPH>^ 
REGIA 


SAG 


GDB: 120365 


2q37.1-2q37.1 


S-ANTIGEN; SAG 


SFTPB 


GDB:120374 


2pl2-2pll.2 


SURFACTANT-ASSOCLVTED 
PROTEIN, PULMONARY-3; 
SFTP3 
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Gene 


GDB Accession ID 


Location 


OMIMLInk 


SLCllAl 


GDB:371444 


2q35-2q35 


CIRRHOSIS PRIMARY- PBC 

NATURAL 

RESISTANCE-ASSOCIATED 
MACROPHAGE PROTEIN 1; 
NRAMPl 


SLC3A1 


GDB:202968 


2pl6.3-2pl6.3 
2p21-2p21 


SOLUTE CARRIER FAMILY 3, 
MEMBER 1; SLC3A1 
CYSTINURIA; CSNU 


SOSl 


GDB:230004 


2p22-2p21 


GINGIVAL SON OF 
SBVENLESS (DROSOPHILA) 
H0M0L0G1;S0S1 


SPG4 


GDB:230127 


2p24-2p21 


SPASTIC PARAPLEGIA-4, 
AUTOSOMAL DOMINANT; 
SPG4 


SRD5A2 


GDB:127343 


2p23-2p23 


PSEUDOVAGINAL 
PERINEOSCROTAL 
HYPOSPADIAS; PPSH 


TCL4 


GDB:136378 


2q34-2q34. 


T-CELL 

LEUKEMLVLYMPHOMA-4; 

TCL4 


TGFA 


GDB:120435 


2pl3-2pl3 


TRANSFORMING GROWTH 
FACTOR, ALPHA; TGFA 


TMD 


GDB:9837196 


2q31-2q31 . 


TIBIAL MUSCULAR 
DYSTROPHY, TARDIVE 


TPO 


GDB:120446 


2p25-2p25 
2p25-2p24 


THYROID 

HORMONOGENESIS, GENETIC 
DEFECT IN, EA 


UGTl 


GDB:120007 


2q37-2q37 


UDP 

GLUCURONOSYLTRANSFERA 
SE 1 FAMILY, Al; UGTlAl 


UV24 


GDB:9955737 


2pter-2qter 


UV-DAMAGE, EXCISION 
REPAIR OF, UV-24 


WSS 


GDB:9955707 


2q32-2q32 


WRINKLY SKIN SYNDROME; 

WSS 


XDH 


GDB:266386 


2p23-2p22 


XANTHINURIA 


ZAP70 


GDB:433738 


2qll-2ql3 
2ql2-2ql2 


SYK-RELATED TYROSINE 
KINASE; SRK 


ZFHXIB 


GDB:9958310 


2q22-2q22 


DISEASE, MICROCEPHALY, 
AND IRIS COLOBOMA 



Table 4: Genes, Locations and Genetic Disorders on Chromosome 3 



Gene 


GDB Accession ID 


Location 


OMIMLinlc 


ACAAl 


GDB:119643 


3p23-3p22 


PEROXISOMAL 
3-OXOACYL-COENZYME A 
THIOLASE DEFICIENCY 
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Gene 


GDB Accession ID 


Location 


OMIMLink 


AGTRl 


GDB:132359 


3q21-3q25 


ANGIOTENSIN n RECEPTOR, 
VASCULAR TYPE 1; AT2R1 


AHSG 


GDB:1 18985 


3q27-3q27 


ALPHA-2-HS-GLYCOPROTBIN; 

AHSG 


AMT 


GDB:132138 


3p21.3-3p21.2 

J^JL L mZt~D\J£^ 1 . JL 


HYPERGLYCINEMIA, ISOLATED 


ARP 


GDB:9959049 


3p21.1-3p21.1 


ARGININE-KICH PROTEIN 


BBS3 


GDB:376501 


3p-3p 

3pl2,3-3qll.l 


BARDET-BIEDL SYNDROME, 
TYPE 3; BBS3 


BCHE 


GDB:120558 


3q26.1-3q26.2 


BUTYRYLCHOLINESTERASE; 


BCPM 


GDB:433809 


3q21-3q21 


BENIGN CHRONIC PEMPfflGUS; 
BCPM 


BTD 


GDB:309078 


3p25-3p25 


BIOTINIDASE;BTD 


CASR 


GDB:134196 


3q21-3q24 


HYPOCALOURIC 
HYPERCALCEMIA, FAMILIAL; 
HHCl 


CCR2 


GDB:337364 


3p21-3p21 


CHEMOKINE (C-C) RECEPTOR 2; 


CCR5 


GDB:1230510 


3p21-3p21 


CHEMOKINE (C-C) RECEPTOR 5; 
CMKBR5 


CDLl 


GDB:136344 


3q26.3-3q26.3 


DE LANGE SYNDROME; CDL 


CMr2B 


GDB:604021 


3ql3-3q22 


CHARCOT-MARIE-TOOTH 
DISEASE, NEURONAL TYPE, B; 
CMT2B 


C0L7A1 


GDB:128750 


3p21-3p21 

3p21.3-3p21.3 


COLLAGEN, TYPE VH, ALPHA-1; 

C0L7A1 


CP 


GDB:1 19069 


3q23-3q25 
3q21-3q24 


CERULOPLASMIN; CP 


CRV 


GDB:1 1498333 


3p21.3-3p21.1 


VASCULOPATHY, RETINAL, 
WTIH CEREBRAL 
LEUKODYSTROPHY 


CTNNBl 


GDB:141922 


3p22-3p22 
3p21.3-3p21.3 


CATENIN, BETA 1; CTNNBl 


DEM 


GDB:681157 


3pl2-3qll 


DEMENTIA, FAMILIAL 
NONSPECIFIC; DEM 


ETMl 


GDB:9732523 


3ql3-3ql3 


TREMOR, HEREDITARY 
ESSENTIAL 1; ETMl 


FANCD2 


(3)6:698345 


3p25.3-3p25.3 
3nter-3n24 2 


FANCONI PANCYTOPENIA, 
rOMPLFMENTATION GROT TP D 


FIH 


GDB:9955790 


3ql3-3ql3 


HYPOPARATHYROIDISM, 
FAMILIAL ISOLATED; FIH 


F0XL2 


GDB:129025 


3q23-3q23 


BLEPHAROPHIMOSIS, 
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3q22-3q23 


EPICANTHUS INVERSUS, AND 
PTOSIS; BPES 


GBEl 


GDB:138442 


3pl2-3pl2 


GLYCOGEN STORAGE DISEASE 

IV 


GLBl 


GDB: 119987 


3p22-3p21.33 
3p21.33-3p21.33 


GANGLIOSIDOSIS, 
GENERALIZED GMl, TYPE I 


GLCIC 


GDB:3801941 


3q21-3q24 


GLAUCOMA 1, OPEN ANGLE, C; 
GLCIC 


GNAI2 


GDB: 120516 


3p21.3-3p21.2 


GUANINE 

NUCLEOTIDE-BINDING 
PROTEIN, ALPHA-INHIBrriNG, 
POLYPEPTIDE-2; 


GNATl 


GDB- 119277 


3o21 3-3ij21 2 


GUANINE 

NUCLEOTIDE-BINDING 
PROTEIN, 

ALPHA-TRANSDUCING, 

POLYPEPTIDE 


GP9 


GDB:126370 


3pter-3qter 


PLATELET GLYCOPROTEIN IX; 


GPXl 


GDB:1 19282 


3qll-3ql2 
3pZl.3-ip21.o 


GLUTATHIONE PEROXIDASE; 


HGD 


GDB:203935 


3q21-3q23 


ALKAPTONUIOA; AKU 


HRG 


GDB:120055 


3q27-3q27. 


mSTIDINE-RICH 
GLYCOPROTEIN; HRG; HRGP 


mHi 


GDB: 120107 


3p21^-3p21.1 


INTER-ALPHA-TRYPSIN 
INHIBITOR, HEAVY CHAIN-U 
ITIHl;IATIH;rnH 


KNG 


GDB: 125256 


3q27-3q27 


FLAUJEAC FACTOR 

DEFICIENCY 


LPP 


GDB:1391795 


3q27-3q28 


LIM DOMAIN-CONTAINING 
PREFERRED TRANSLOCATION 
PARTNER IN LIPOMA; LPP 


LRSl 


GDB:682448 


3p21.1-3pl4.1 


LARSEN SYNDROME, 
AUTOSOMAL DOMINANT; LRSl 


MCCCl 


GDB: 135989 


3q27-3q27 
3q25-3q27 


BETA-METHYLCROTONYLGLY 
Cn^JUlOAI 


MDSl 


GDB:250411 


3q26-3q26 


MYELODYSPLASIA SYNDROME 
1;MDS1 


MHS4 


GDB:574245 


3ql3.1-3ql3.1 


HYPERTHERMIA 
SUSCEPTTBILrrY-4; MHS4 


MTTF 


GDB:214776 


3pl4.1-3pl2 


MICROPHTHALMIA-ASSOCIATE 
D TRANSCRIPTION FACTOR; 
MITF WAARDENBURG 
SYNDROME, TYPE H; WS2 
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Gene 


GDB Accession ID 


Location 


OAHMLink 


MLHl 


GDB:249617 


3p23-3p22 

3p21.3-3p21.3 


COLON CANCER, FAMILIAL, 

NONPOLYPOSIS TYPE 2; FCC2 


MYL3 


GDB:120218 


3p21.3-3p21.2 


MYOSIN, LIGHT CHAIN, 
SKELETAL SLOW; MYL3 


MYMY 


GDB:11500610 


3p26-3p24.2 


DISEASE 


OPAl 


GDB:118848 


3q28-3q29 


OPTIC ATROPHY 1; OPAl 


PBXPl 


GDB: 125352 


3q22-3q23 


PRE-B-CELL LEUKEMIA 
TRANSCRIPTION FACTOR-1; 

PBXl 


PCCB 


GDB:1 19474 


3q21-3q22 


GLYCINEMIA, KETOTIC, E 


POUIFI 


GDB:129070 


3pll-3pll 


POU DOMAIN, CLASS 1, 
TRANSCRIPTION FACTOR 1; 
POUIFI 


PPARG 


GDB:1223810 


3p25-3p25 


CANCER OF COLON 
PEROXISOME PROLIFERATOR 
ACTIVATED RECEPTOR. 
GAMMA; PPARG 


PROSl 


GDB:120721 


3pll-3qll 
3pll.l-3qll.2 


PROTEIN S, ALPHA; PROSl 


PTEIRl 


GDB:138128 


3p22-3p21.1 


METAPHYSEAL 
CHONDRODYSPLASLV, MURK 
JANSEN TYPE PARATHYROID 
HORMONE RECEPTOR 1; PlHRl 


RCAl 


GDB:230233 


3pl4^-3pl4.2 


RENAL CARCINOMA, 
FAMILIAL, ASSOCL^TED 1; 


RHO 


GDB:120347 


3q21.3-3q24 


RHODOPSIN;RHO 


SCA7 


GDB:454471 


3p21.1-3pl2 


SPINOCEREBELLAR ATAXIA 7; 
SCA7 


SCLCl 


GDB:9955750 


3p23-3p21 


SMALL-CELL CANCER OF THE 

LUNG; SCCL 


SCN5A 


GDB:132152 


3p21-3p21 


SODIUM CHANNEL, 
VOLTAGE-GATED, TYPE V, 
ALPHA POLYPEPTIDE; SCN5 A 


SI 


GDB:120377 


3q25.2-3q26,2 


DIS ACGHARIDE INTOLERANCE 
I 


SLC25A20 


GDB:6503297 


3p21.31-3p21.31 


CARNITINE-ACYLCARNiriNE 
TRANSLOCASE; CACT 


SLC2A2 


GDB:119995 


3q26.2-3q27 
3q26.1-3q26.3 


SOLUTE CARRIER FAMILY 2, 
MEMBER 2; SLC2A2 
FANCONI-BICKEL SYNDROME; 
FBS 


TF 


GDB:120432 


3q21-3q21 


TRANSFERRIN; TF 
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TGFBR2 


GDB:224909 


3p22-3p22 
3pter-3p24.2 


TRANSFORMING GROWTH 
FACTOR-BETA RECEPTOR, 
Lirhll, IGroKz 


THPO 


GDB:374007 


3q26.3-3q27 


THROMBOPOIETIN; THPO 


THRB 


GDB: 120731 


3p24.1-3p22 
3p24.3-3p24.3 


THYROID HORMONE 
RECEPTOR, BETA; THRB 


TKT 


GDB:132402 


3pl4.3-3pl4.3 


WERNICKE-KORSAKOFF 
SYNDROME 


TM4SF1 


GDB:250815 


3q21-3q25 


TUMOR-ASSOCIATED ANTIGEN 
Lo; TAALo 


TRH 


GDB: 128072 


3pter-3qter 


THYROTROPIN-RELEASING 
HORMONE DEFICIENCY 


IMPS 


GDB: 120482 


3ql3-3ql3 


OROTICACIDURIAI 


UQCRCl 


GDB:141850 


3p21.3-3p21.2 
3p21.3-3p21.3 


UBIQUINOL-CYTOCHROME c 
REDUCTASE CORE PROTEIN I; 
UQCRCl 


USH3A 


GDB:392645. 


3q21-3q25 


USHER SYNDROME, TYPE HI; 
USH3 


VHL 


GDB: 120488 


3p26-3p25 


VON HIPPEL-LINDAU 
SYNDROME; VHL 


WS2A 


GDB: 128053 


3pl42-3pl3 


MICROPHTHALMIA-ASSOCIATE 
D TRANSCRIPTION FACTOR; 
MTTF WAARDENBURG 
SYNDROME, TYPE H; WS2 


XPC 


GDB:134769 


3p25.1-3p25.1 


XERODERMA PICMENTOSUM, 
COMPLEMENTATION GROUP C; 
XPC 


ZNF35 


GDB: 120507 


3p21-3p21 


ZINC FINGER PROTEIN-35; 
ZNF35 



Table 5: Genes, Locations and Genetic Disorders on Chromosome 4 



Gene 


GDB Accession ID 


Location 


OMIMLink 


ADHIB 


GDB:1 19651 


4q21-4q23 
4q22-4q22 


ALCOHOL 

DEHYDROGENASE-2; ADH2 


ADHIC 


GDB: 119652 


4q21-4q23 
4q22-4q22 


ALCOHOL 

DEHYDROGENASE-3; ADH3 


AFP 


GDB:1 19660 


4qll-4ql3 


ALPHA-FETOPROTEIN; AFP 


AGA 


GDB:1 18981 


4q23-4q35 
4q32-4q33 


ASPARTYLGLUCOSAMBSrURIA 
;AGU 


AIH2 


GDB:1 18751 


4qll-4ql3 
4ql3.3-4q21.2 


AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL, 
AUTOSOMAL DOMINANT; 
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AT*R 






AT "RTTMTM* ATTi 


ASMD 


GDB: 119705 


4q-4q4q28-4q31 


ANTERIOR SEGMENT OCULAR 
DYSGENESIS; ASOD 


BFEID 


GDB:11498907 


4q34.1-4q35 


DYSPLASIA, BEUKES TYPE 


CNGAl 


GDB: 127557 


4pl4-4ql3 


CYCLIC NUCLEOTIDE GATED 
CHANNEL, PHOTORECEPTOR, 

CvXiVLr VJ/VIJdJL', X, l^iNL/VJl 


CRBM 


GDB:9958132 


4pl6.3-4pl6.3 


CHERUBISM 


DCK 


GDB:126810 


4ql3.3-4q21.1 


DEOXYCYTIDINE KINASE; 
DCK 


DFNA6 


GDB:636175 


4pl6.3-4pl6.3 


DEAFNESS, AUTOSOMAL 
DOMINANT NONS YNDROMIC 
SENSORINEURAL, 6; DFNA6 


DSPP 


GDB:5560457 


4pter-4qter 
iqz 1 . j-'iqz 1 .J 


DENTIN PHOSPHOPROTEIN; 

"HPP ■nin'NTTTXTOmHXnHCTC 
Urr JJJiJN 1 JJNUOJiJNJtiolo 

EvIPERFECTA; DGIl 


DTDP2 


GDB:9955810 


4q-4q 


DENTIN DYSPLASIA, TYPE H 


ELONG 


GDB:1 1498700 


4q24-4q24 




ENAM 


GDB:9955259 


4q21-4q21 


AMELOGENESIS IMPERFECTA 
2, HYPOPLASTIC LOCAL. 
AUTOSOMAL DOMBSTANT; 
AMELOGENESIS IMPERFECTA, 
HYPOPLASTIC TYPE 


ETFDH 


GDB:135992 


4q32-4q35 


GLUTARICACIDURIA EC; GA 


BVC 


GDB:555573 


4pl6-4pl6 


ELLIS-VAN CREVELD 
SYNDROME; EVC 


Fll 


GDB:1 19891 


4q35-4q35 


PTA DEFICIENCY 


FABP2 


GDB:1 19127 


4q28-4q31 


FATTY ACID BINDING 
PROTEIN 2, INTESTINAL; 
FABP2 


FGA 


GDB:1 19129 


4q28-4q28 . 


AMYLOIDOSIS, FAME^IAL 
VISCERAL FIBRIISrOGEN. A 
ALPHA POLYPEPTIDE; FGA 


FGB 


GDB: 119130 


4q28-4q28 


FIBRINOGEN, B BETA 
POLYPEPTIDE; FGB 


FGFR3 


GDB:127526 


4pl6.3-4pl6.3 


ACHONDROPLASIA; ACH 
BLADDER CANCER 
FIBROBLAST GROWTH 
FACTOR RECEPTOR-3; FGFR3 


FGG 


GDB:1 19132 


4q28-4q28 


FIBRINOGEN, G GAMMA 
POLYPEPTIDE; FGG 


FSHMDIA 


GDB:1 19914 


4q35-4q35 


FACIOSCAPULOHUMERAL 
MUSCULAR DYSTROPHY lA; 
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ujyuiVi ijinK 










GC 


GDB:1 19263 


4ql2-4ql3 


GROUP-SPECIFIC 


GNPTA 


GDB:119280 


4q21-4q23 


MUCOLIPIDOSIS H; ML2; ML n 


GNRHR 


GDB:136456 


4ql3-4ql3 


GONADOTROPIN-RELEASING 
HORMONE RECEPTOR; GNRHR 


GYPA 


GDB:1 18890 


4q28-4q31 


BLOOD GROUP-MN LOCUS; 

MJN 




vjuo.yyD'to ij 




WVT>*RT?r^ AT r'TTTDT A T7ATi>nTTAT 

Jtl I Jr JiJv\^/\J-/l^l U JKiA, r AMLLIAL 
IDIOPATHIC 


HCL2 


GDBil 19305 


4q28-4q31 4q-4q 


HAIRCOLOR-2;HCL2 


nu 




*fpl0.o-4piO.:> 


tlUJNllWLrlUJN DiailAuOc; HD 


HTN3 


GDB: 125601 


4ql2-4q21 


HISTAnN-3; HTN3 


HVBS6 


GDB:120687 


4q32-4q32 


HEPATOCELLULAR 
CARCINOMA-2; HCC2 


IDUA 


GDB: 119327 


4pl6.3-4pl6.3 


MUCOPOLYSACCHARIDOSIS 
TYPE I; MPS I 


IF 


C2)B:120077 


4q24-4q25 
4q25-4q25 


COMPLEMENT COMPONENT-3 
INACTIVATOR, DEnCIENCY 
OF 


JPD 


GDB:120113 


4pter-4qter 
4ql2-4ql3 


PERIODONTITIS, JUVENILE; 
JPD 


KIT 


GDB:120117 


4ql2-4ql2 


V-KTT HARDY-ZUCKERMAN 4 
FELINE SARCOMA VIRAL 
ONCOGENE HOMOLOG; KIT 


KLKBl 


GDB:127575 


4q34-4q35 
4q35-4q35 


FLETCHER FACTOR 
DEFICIENCY 


LQT4 


GDB:682072 


4q25-4q27 


SYNDROME WITHOUT 

PSYCHOMOTOR 

RETARDATION 


MANBA 


GDB:125261 


4q21-4q25 


MANNOSIDOSIS, BETA; 
MANBl 


MLLT2 


GDB:136792 


4q21-4q21 


MYELOID/LYMPHOID OR 
MIXED LINEAGE LEUKEMDV, 
TRANSLOCATED TO, 2; MLLT2 


MSXl 


GDB: 120683 


4pl6.3-4pl6.1 
4pl6.1-4pl6.1 


MSH, DROSOPHILA, HOMEO 
BOX, HOMOLOG OF, 1; MSXl 


MTP 


(H)B:228961 


4q24-4q24 


MICROSOMAL TRIGLYCERIDE 
TRANSFER PROTEIN, 88 KD; 
MTP 


NR3C2 


GDB:120188 


4q31-4q31 
4q31.1-4q31.1 


PSEUDOHYPOALDOSTERONIS 
M, TYPE L AUTOSOMAL 
RECESSIVE; PHAl 
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Gene 


GDB Accession ED 


Location 


OMIMLink 


PBT 


GDB:120260 


4ql2-4q21 


PIEBALD TRAIT; PBT 


PDE6B 


GDB:125915 


4pl6.3-4pl6.3 


NIGHTBLESIDNESS, 

CONGENITAL STATIONARY; 
CSNB3 PHOSPHODIESTERASE 
6B, cGMP-SPECIFIC, ROD, 
BETA;PDE6B 


PEEl 


GDB:7016765. 


4q31-4q34 
4q25-4qter 


1;PEE1 


prrx2 


GDB:134770 


4q25-4q27 
4q25-4q26 
4q25-4q25 


miDOGONrODYSGENESIS, 
TYPE 2; IRID2 RIEGER 
SYNDROME. TYPE 1- KEGl 
RJEG BICOID-RELATED 
HOMEOBOX TRANSCRIPTION 
FACTOR 1; RIEGl HOMEO BOX 
2 


PKD2 


GDB:1 18851 


4q21-4q23 


POLYCYSTIC KIDNEY 
DISEASE 2; PKD2 


QDPR 


GDB:120331 


4pl5.3-4pl5.3 
4pl5.31-4pl5.31 


PHENYLKETONURIA H 


SGCB 


GDB:702072 


4ql2-4ql2 


MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2E; 
LGMD2E 


SLC25A4 


GDB:119680 


4q35-4q35 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 1; ANTl 
PROGRESSIVE EXTERNAL 
OPHTHALMOPLEGL^.; PEO 


SNCA 


GDB:439047 


4q21.3-4q22 
4q21-4q21 


SYNUCLEIN, ALPHA; SNCA 
PARKINSON DISEASE. 
rAMlLlAL, TYPE 1; PARKl 


SODS 


GDB:125291 


4pl6.3-4q21 


SUPEROXIDE DBMUTASE, 

r/XlKA.CELLULAR; SOD3 


JslAlrl 


yjUti.ljiUiyl 


A^t t Ant 1 

4qll-4qiJ 


STATHERIN; STATH; STR 


TAPVRl 


GDB:392646 


4pl3-4qll 


ANOMALOUS PULMONARY 
VrlJNUUb Kbl UKN; ArVR 


TYS 


GDB: 119624 


4q-4q 


SCLEROTYLOSIS; TYS 


WBS2 


GDB:132426 


4q33-4q35.1 


WILLIAMS-BEUREN 
SYNDROME; WBS 


WFSl 


GDB:434294 


4p-4p 4pl6-4pl6 


DIABETES MELLITUS AND 
INSIPIDUS WITH OPTIC 
ATROPHY AND DEAFNESS 


WHCR 


GDB:125355 


4pl6.3-4pl6.3 


WOLF-HIRSCHHORN 
SYNDROME; WHS 
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Gene 


GDB Accession ID 


OMIMLink 


ADAMTS2 


GDB:9957209 


EHLERS-DANLOS SYNDROME, 

TYPE Vn, AUTOSOMAL RECESSIVE 


ADRB2 


GDB:120541 


BETA-2-ADRENERGIC RECEPTOR; 
ADRB2 


AMCN 


GDB:9836823 


ARTHROGRYPOSIS MULTIPLEX 
CONGENITA, NEUROGENIC TYPE 


AP3B1 


GDB:9955590 


HERMANSKY-PUDLAK 
SYNDROME; HPS 


APC 


GDB:1 19682 


ADENOMATOUS POLYPOSIS OF 

'TOT? r^CW r\K^- A TkC^ 


ARbB 


tjDB:iiyuoo 


mxjKJJrKJL, I o AUCJblAjsJULlUolS TYF£ 
VI; MPS VI 


B4GALT7 


CjrDl5:yy57o53 


iSYNDKUMil, JriiUCjilKUiD rORM 


BHRl 


GDB:9956078 


ASTHMA 


C6 


GDB: 11 9045 


COMPLEMENT COMPONENT-6, 

DEFICIENCY OF 


C7 


GDB:119046 


COMPLEMENT COMPONENT-7, 
DEFICIENCY OF 


CCAL2 


GDB:5584265 


CHONDROCALCINOSIS, FAMILIAL 
ARTICULAR 


CKNl 


GDB: 128586 


COCKAYNE SYNDROME, TYPE I; 
CKNl 


CMDJ 


GDB:9595425 


CRANIOMETAPHYSEAL 
DYSPLASIA, JACKSON TYPE; CMDJ 


CRHBP 


GDB:127438 


CORTICOTROPIN RELEASING 
HORMONE-BINDING PROTEIN; 
CRHBP 


CSFIR 


GDB: 120600 


COLONY-STIMULATING FACTOR-1 
RECEPTOR; CSFIR 


DHFR 


GDB: 119845 


DIHYDROFOLATE REDUCTASE; 
DHFR 


DIAPHl 


GDB:9835482 


DEAFNESS. AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 1; DFNAl 
DIAPHANOUS, DROSOPHILA, 
HOMOLOG OF, 1 


DTR 


GDB:119853 


DIPHTHERIA TOXIN SENSITIVITY; 






■D/^CrKT/^TlTTTr T A "CATiyTTTTAT 


Hiv VJtv 




DEGENERATION OF WAGNER 


F12 


GDB: 119892 


HAGEMAN FACTOR DEFICIENCY 


FBN2 


GDB:128122 


CONTRACTURAL 
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Gene 


GDB Accession ID 


OMIMLink ' 






ARACHNODACTYLY, 
CONGENITAL; CCA 


GDNF 


GDB:450609 


GLIAL CELL LINE-DERIVED 
NEUROTROPHIC FACTOR; GDNF 


GHR 


GDB: 119984 


GROWTH HORMONE RECEPTOR; 
rixro 




yjxJD, 1 looi/i 


yjL, X y^LCsc JKUi^jir l \Jj\, a i .km a- i 
SUBUNIT; GLRAl KOK DISEASE 


VJlVlZ/V 




T A V Q A PWQ "nTQT? A CT7 AT3 A/A'DTA XTT 
l/\ I -Oj^iL^^no I>'lorl//Vorl/, Ax5 V AKJAIN 1 










VJl^JS . J O ^ 7 


1 / -{u^C 1 r\rSji I UlsXJJ^ Loi lilvUll-' 

DEHYDROGENASE IV; HSD17B4 




yJUlS, lZoU3 1 


UN ICvjInJPm, AIjX^xIA-z, IIUAZ 




riFiTl • QQ ^ RO 8 7 
VJl>'13.yy3ol7 0 / 


V liK. 1 IZ/IdJnJvL r U olUJN 


LGMDIA 


GDB: 11 8832 


MUSCULAR DYSmOPHY, 

l^lXVLt5-ljrlKlJI #r#, 1 xlrJi lA; LXjNIUIA 


LOX 


GDB:1 19367 


LYS YL OXIDASE; LOX 


LTC4S 


GDB:3S4080 


LEUKOTRIENE C4 SYNTHASE; 
LTC4S 


MAN2A1 


GDB:136413 


MANNOSIDASE, ALPHA, H; MANA2 
DYSERYTHROPOIETIC ANEML\, 

I^^XmPXTTTAT TVT>"P TT 


MCC 


GDB:128163 


MUTATED IN COLORECTAL 


MCCC2 


GDB:135990 


n 


MSH3 


GDB:6419S6 


MutS, E. COLI, HOMOLOG OF, 3; 
MSH3 


MSX2 


GDB: 138766 


MSH (DROSOPHILA) HOMEO BOX 
HOMOLOG 2; MSX2 PARIETAL 

PO"D AAvTTKTA G'VTV>r\>rDTT>Tr*. 'DTTXiT 


NR3C1 


GDB:120017 


GLUCOCORTICOID RECEPTOR; GRL 


PCSKl 


Ga)B:128033 


PROPROTEIN CONVERTASE 
SUBTILISIN/B[EXIN TYPE 1; PCSKl 


PDE6A 


GDB: 120265 


PHOSPHODIESTERASE 6A, 
cGMP-SPECMC, ROD, ALPHA; 

JrUriOA 


rrDi 




IXNlJlJNoll I \Jr UNrliV-rllUJN UN 


RASAl 


GDB:120339 


RAS p21 PROTEIN ACTIVATOR 1; 


SCZDl 


GDB: 120370 


DISORDER-l; SCZDl 


SDHA 


GDB:378037 


SUCCINATE DEHYDROGENASE 
COMPLEX, SUBUNIT A, 



160 




wo 2004/010106 PCT/US2003/023075 



Gene 


GDB Accession ID 


OMIMLink 






FLAVOPROTEIN; SDHA 


SGCD 


GDB:5886421 


SARCCXjLYCAN, DELTA; SGCD 


SLC22A5 


GDB:9863277 


CARNITINE DEFICIENCY, 
SYSTEMIC, DUE TO DEFECT EST 
RENAL REABSORPTION 


SLC26A2 


GDB: 125421 


DIASTROPfflC DYSPLASIA; DTD 
EPIPHYSEAL DYSPLASIA, 
MULTIPLE; MED NEONATAL 
OSSEOUS DYSPLASIA I 
ACHONDROGENESIS, TYPE IB; 
ACGIB 


SLC6A3 


GDB:132445 


SOLUTE CARRIER FAMILY 6, 
MEMBER 3; SLC6A3 
DEFICIT-HYPERACTIVITY 
DISORDER; ADHD 


SMI 


GDB:9834488 


SCHISTOSOMA MANSONI 
SUSCEPTEBimY/RESISTANCE 


SMA@ 


GDB: 120378 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 1, TELOMERIC; SMNl 


SMNl 


GDB:5215173 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 1, TELOMERIC; SMNl 


SMN2 


GDB:5215175 


SPINAL MUSCULAR ATROPHY I; 
SMA I SURVIVAL OF MOTOR 
NEURON 2, CENTROMERIC; SMN2 


SPINK5 


GDB:9956114 


NETHERTON DISEASE 


TCOFl 


GDB:127390 


TREACHER 

COLLINS-FRANCESCHETTI 
SYNDROME 1; TCOFl 


TGFBI 


GDB:597601 


CORNEAL DYSTROPHY, 
GRANULAR TYPE CORNEAL 
DYSTROPHY, LATTICE TYPE I; 
CDLl TRANSFORMING GROWTH 
FACTOR, BETA-INDUCED, 68 KD; 
TGFBI 



Table 7: Genes, Locations and Genetic Disorders on Chromosome 6 



Gene 


GDB Accession ID 


OMIMLink 


ALDH5A1 


GDB:454767 


SUCCINIC SEMLMDEHYDE 
DEHYDROGENASE, 
NAD(+)-DEPENDENT; SSADH 


ARGl 


GDB ill 9006 


ARGININEMIA 
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Gene 


GDB Accession ID 


OMIMLiiik 


A CI 

AS 


GDB: 135097 


AJNJvi LAJollNu' aJfUWDYLlllb; Ab 


ASSP2 


GDB:119017 


CTTRULLINEMIA 


BCKDHB 


GDB:118759 


MAPLE SYRUP. URINE DISEASE, TYPE 
IB 


BF 


GDB: 119726 


GLYCINE-RICH 

iA-LrijYUUrKUiiiJJN; U±>(jr 


C2 


GDB: 119731 


COMPLEMENT COMPONENT-2, 


C4A 


GDB: 119732 


COMPLEMENT COMPONENT 4A; C4A 


CDKNIA 


GDB:266550 


CYCLBSr-DEPENDENT KINASE 
INHIBrrOR lA; CDKNIA 


COLlOAl 


GDB: 128635 


COLLAGEN, TYPE X, ALPHA 1; 
COLlOAl 


COL11A2 


GDB:119788 


COLLAGEN, TYPE XI. ALPHA-2; 
COLl 1 A2 STICKLER SYNDROME, 
TYPE H; STL2 DEAFNESS, 
AUTOSOMAL DOMINAMT 
NONSYNDROMIC SENSORINEURAL, 
13; DFNA13 


CYP21A2 


GDB: 120605 


ADRENAL HYPERPLASL\, 
CONGENITAL, DUE TO 
21-ilYDKUAYLAbb DJirlCiciNCY 


DYX2 


GDB:437584 


DYbLbXlA, brbLirlC, 2; DYXz 


EJMl 


GDB:1 19864 


MYOCLONIC EPILEPSY, JUVENILE; 
EJMl 


EL0VL4 


GDB: 11499609 


STARGARDT DISEASE 3; lbTCjD3 


EPM2A 


GDB:3763331 


EPILEPSY, PROGRESSIVE 
MYOCLONIC 2; EPM2 


ESRl 


GDB: 119120 


ESTROGEN RECEPTOR; ESR 


EYA4 


GDB:700062 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 
10; DFNAIO 


F13A1 


GDB:120614 


FACTOR Xm, Al SUBUNTT; F13A1 


FANCE 


GDB: 1220236 


FANCONIANEML\, 
COMPLEMENTATION GROUP E; 
FACE 


GCLC 


GDB:132915 


GAMMA-GLUTAMYLCYSTEINE 
SYNTHETASE DEHCIENCY, 

HEMOLYTIC AMEMIA DUE 


LrJAl 




vj/Vr J UXN \^ i }\JiS rtsxj X iZrirN, J\xjr xi/\- 1 , 
43 KD; GJAl 


GLYSl 


GDB:136421 


GLYCOSUIOA, RENAL 


GMPR 


GDB:127058 


GUANINE MONOPHOSPHATE 



162 




wo 2004/010106 PCT/US2003/023075 



Gene 


GDB Accession ED 


OMIMLink ~ " " 






REDUCTASE 


GSE 


GDB:9956235 


DISEASE; CD 


HCR 


GDB:9993306 


PSORIASIS, SUSCEPTIBILITY TO 

HFEGDB:119309 
HEMOCHROMATOSIS; HFE 


HLA-A 


GDB: 119310 


HLA-A HISTOCOMPATIBIUTY TYPE; 
HLAA HLA-DPB1GDB:120636 LA-DP 
HISTOCOMPATIBILITY TYPE, BETA-1 
SUBUMT 


HLA-DRA 


GDB: 120641 


HLA-DR HISTOCOMPATIBILITY 
TYPE; HLA-DRA 


HPEH 


GDB:9849006 


HETEROCELLULAR HEREDITARY 
PERSISTENCE OF HSTAL 
HEMOGLOBIN 


ICSl 


GDB:136433 


IMMOTILE CILIA SYNDROME-1; ICSl 


IDDMl 


GDB:9953173 


DIABETES MELLITUS, 
JUVENILE-ONSET 
INoULJttM-DEPENDKNT; JDDM 


IFNGRl 


GDB: 120688 


INTERFERON, GAMMA, RECEPTOR-1; 
IrJNGKl 


ICjADI 


(jrDB:oy2yU77 


CT7T T3/^'iM V7T? T^T?T7T/*^TC?XT/^"Vr /^t? 

bELECilVb DEJi^iClbJNCY Or 


IGrzK 


CrDB:12UUo3 


TKTCTTT TXT T TX^T? r^Xi rWWTTl T? A r^^fWk O 

INbULlN-LiKE GROWTH FACTOR 2 
RECEPTOR; IGF2R 


ISCW 


GDB:995ol58 


OTTOTlT^TJCOT/^XT. TO 

SUPPRESSION; IS 


LAMA2 


GDB: 132362 


T AH iTTXTTXT A T TkTT A O. T ATViTA'^ 

LAMINLN, ALPHA 2; LAMA2 


T A Tl 

LAP 


GDB:9958992 


LARYNGEAL ADDUCTOR 
PARALYSIS; LAP 


LCA5 


GDB:11498764 


AMAUROSIS CONGENITA OF LEBER I 


LPA 


GDB: 120699 


APOLIPOPROTEIN(a); LPA 


MCDRl 


GDB:131406 


MACULAR DYSTROPHY, RETINAL, 1, 
NORTH CAROLINA TYPE; MCDRl 


MOCSl 


GDB:9862235 


MOLYBDENUM COFACTOR 
DEnCIENCY 


MUX 


GDB:120204 


METHYLMALONICACIDURIA DUE 
TO METHYLMALONIC CoA MUTASE 
DEFICIENCY 


MYB 


GDB:1 19441 


V-MYB AVIAN MYELOBLASTOSIS 
VIRAL ONCOGENE HOMOLOG; MYB 


NEUl 


GDB:120230 


NEURAMINIDASE DEFICIENCY 


NKSl 


GDB:128100 


SUSCEPTIBILrrY TO LYSIS BY 
ALLOREACTIVE NATURAL KILLER 

CELLS; ECl 


NYS2 


GDB:9848763 


NYSTAGMUS, CONGENITAL 
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Gene 


GDB Accession. ID 


OMIMLink ^T" " 


OA3 


GDB: 136429 


ALBINISM, OCULAR, AUTOSOMAL 


\jUUiJ 




OPT TT nnPxrmnTnTT a t tivqpt acta. 
ODDD 




KjiJD, lZUZ*f / 




PARK2 


GDB:6802742 


PARKINSONISM, JUVENILE 


PBCA 


GDB:9956321 


BETA CELL AGENESIS WITH 

"MTJr^XTATAT TIT A'D'RT'DC "NyTDT T TTTTC 




\3Ud.j /Ojjjj 


m'm?Tm?T7TTKTAT ATT>rkDTTV 

PROGRESSIVE BIFOCAL; CRAPB 




LrJJ±$.13o34y 


l^loxSAol^ KJr oUJNli; rDr> 


PEX3 


GDB:9955507 


ZELLWEGER SYNDROME; ZS 


PEX6 


GDB:5592414 


ZELLWEGER SYNDROME; ZS 
PEROXIN-6; PEX6 


PEX7 


GDB:6155803 


RHIZOMEUC CHONDRODYSPLASIA 
PUNCTATA^ RCDP PEROXIN-7; PEX7 


PKHDl 


GDB:433910 


POLYCYSTIC KIDNEY AND HEPATIC 

FlTCTJ A C"P 1 • "DVXmi 

UloHAoil-l, irJs±LLIl 


PLA2G7 


GDB:9958829 


PLATELET-ACTIVATING FACTOR 


PLG 


GDB:1 19498 


PLASMINOGEN; PLG 


POLH 


GDB:6963323 


PIGMENTOSUM WITH NORMAL DNA 
REPAIR RATES 


PPAC 


GDB:9956248 


ARTHROPATHY. PROGRESSIVE 
PSEUDORHEUMATOID, OF 

/'111 1 i T^XT/^r^T\ 


Ji'oUKol 


LjiJxJ.OJoiilv/ 


'DCrf^'DT A OTO OT TC/^Tj'D'T'FDTT T'T'\7" T"/^ 

rbUKlAiSio, 0 U ISCbr 1 ioiLll Y 1 U 


PUJO 


GDB:9956231 


MULTICYSTIC RENAL DYSPLASIA, 

"DTT A TCD AT. A4T>T\ 


RCDl 


GDB:333929 


RETINAL CONE DEGENERATION 


RDS 


GDB:118863 


RETINAL DEGENERATION, SLOW; 
RDS 


RHAG 


GDB:136011 


RHESUS BLOOD 
GROUP-ASSOCIATED 
GLYCOPROTEIN; RHAG RH-NULL, 
REGULATOR TYPE; RHN 


RP14 


GDB:433713 


RETINITIS PIGMENTOSA-14; RP14 
TUBBY-LIKE PROTEIN 1; TULPl 


RUNX2 


GDB:392082 


CLEIDOCRANIAL DYSPLASIA; CCD 
CORE-BENDING FACTOR, RUNT 

DOMAIN, ALPHA SUBUNIT 1; CBFAl 


RWS 


GDB:9956195 


SENSITIVITY 
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Gene 


GDB Accession ID 


1 U!" 'Ujf — tt !<■ 'Ui! Uudi- luii J! lu^ »^ ij^ji iKUl i 

OMDMLink 


Qf^ A 1 


yjUD.lLVjoo 




SCZD3 


GDB:635974 


DISOKDER-3; SCZD3 


SIASD 


GDB:433552 


SIALIC ACID STORAGE DISEASE; 






MITOCHONDRIAL; S0D2 


ST8 


GDB:6118456 


OVARIAN TUMOR 


TAPl 


GDB: 132668 


•raANSPORTER 1, ABC; TAPl 


TAP2 


GDB:132669 


TRANSPORTER 2, ABC; TAP2 


TFAP2B 


GDB:681506 


DUCTUS ARTERIOSUS; PDA 
TRANSCRIPTION FACTOR AP-2 


TNDM 


GDB:9956265 


DIABETES MELLrrUS, TRANSIENT 

XTPnxr A T A T 


TNF 


GDB: 120441 


TUMOR NECROSIS FACTOR; TNF 


TPBG 


GDB:125568 


TROPHOBLAST GLYCOPROTEIN; 

IJriSVj, MOJrl 


TPMT 


GDB:209025 


THIOPURINE 

S-METHYLTRANSFERASE; TPMT 


TULPl 


GDB:6199353 


TUBBY-LKE PROTEIN 1; TULPl 


WISPS 


GDB:9957361 


ARTEIROPATHY, PROGRESSIVE 
PSEUDORHEUMATOID, OF 
CHILDHOOD 



Table 8: Genes, Locations and Genetic Disorders on Chromosome 7 



Gene 


GDB Accession ID 


OMIM Link 


AASS 


GDB:1 1502144 


HYPERLYSINEMIA 


ABCBl 


GDB:120712 


P-GLYCOPROTEIN-1; PGYl 


ABCB4 


GDB:120713 


P-GLYCOPROTEIN-3; PGY3 


ACHE 


GDB:1 18746 


ACETYLCHOLINESTERASE BLOOD 
GROUP-Yt SYSTEM; YT 


AQPl 


GDB:129082 


AQUAPORIN-1; AQPl BLOOD 
GROUP-COLT0N;CO 


ASL 


GDB:1 19703 


ARGININOSUCCINICACIDURIA 


ASNS 


GDB:1 19706 


ASPARAGINE SYNTHETASE; ASNS; AS 


AUTSl 


GDB:9864226 


DISORDER 


BPGM 


GDB:1 19039 


DIPHOSPHOGLYCERATE MUTASE 
DEFICIENCY OF ERYTHROCYTE 


C7orf2 


GDB:10794644 


ACHEIROPODY 
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Gene 


GDB Accession ID 


OMIMLulk 


CACNA2D1 


GDB:132010 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, L TYPE, 
ALPHA-2yDELTA SUBUNIT; 
MALIGNANT HYPERTHERMIA 


CCMl 


GDB:580824 


CEREBRAL CAVERNOUS 
MALFORMATIONS 1; CCMl 


CD36 


GDB:138800 


CD36 ANTIGEN; CD36 


CFTR 


GDB:120584 


CYSTIC FIBROSIS; CF DEFERENS, 
CONGENITAL BILATERAL APLASIA 


CHORDOMA 


GDB:1 1498328 




CLCNl 


GDB:134688 


CHLORIDE CHANNEL 1, SKELETAL 
MUSCLE; CLCNl 


CMH6 


GDB:9956392 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, WITH 
WOLFF-PARKDSrSON-WHTTE 


CMT2D 


GDB:9953232 


CHARCOT-MARIE-TOOTH DISEASE, 
NEURONAL TYPE, D 


COL1A2 


GDB: 119062 


COLLAGEN, TYPE I, ALPHA-2 
POLYPEPTIDE; COL1A2 
OSTEOGENESIS IMPERFECTA TYPE I 

OQTTnOrilTNTRQTQ ITVAP'R'DTTPr^T A TVPT? 
Wo i Xi^JlJXlXNllolD MVlJrllJxril^iA 1 xsfCt 

IV; 014 


CRS 


GDB:119073 


CRANIOSYNOSTOSIS, TYPE 1; CRSl 


CYMD 


GDB:366594 


MACULAR EDEMA, CYSTOID 


DFNA5 


GDB:636174 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 5; 
DENA5 


DLD 


GDB:120608 


LIPOAMIDE DEHYDROGENASE 
DEFICIENCY, LACTIC ACIDOSIS DUE 
TO 


DYTll 


GDB:10013754 


MYOCLONUS, HEREDITARY 
ESSENTIAL 


EECl 


GDB:136338 


ECTRODACTYLY, ECTODERMAL 
DYSPLASIA, AND CLEFT LIP/PALATE; 


T3T XT 




■PT A QTTM* "PT XT \kTXX T T AAVTQ 151711013X1 

SYNDROME; WBS 


ETVl 


GDB:335229 


ETS VARIANT GENE 1 ; ETVl 


FKBP6 


GDB:9955215 


WILLIAMS-BEUREN SYNDROME; WBS 


GCK 


GDB:127550 


DIABETES MELLITUS, AUTOSOMAL 
DOMINANT, TYPE n GLUCOKINASE; 
GCK 
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Gene 


GDB Accession ID 


OMDMLm]^ 


GHRHR 


GDB:138465 


GROWTH HORMONE-RELEASING 

HORMONE RECEPTOR; GHRHR 


GHS 


GDB:9956363 


MICROSOMIA WITH RADIAL 
DEFECTS ■ 


GLD 


GDB: 119990 


PALLISTER-HALL SYNDROME; PHS 
GLI-KRUPPiEL FAMILY MEMBER 3; 
GLD POSTAXIAL POLYDACTYLY, 
TYPEAIGREIG 
CEPHALOPOLYSYNDACTYLY 
SYNDROME; GCPS 


GPDSl 


GDB:9956410 


GLAUCOMA, PIGMENT-DISPERSION 
TYPE 


GUSB 


GDB:120025 


MUCOPOLYSACCHARIDOSIS TYPE 

vn 


HADH 


GDB:120033 


HYDROXYACYL-CoA 
DEHYDROGENASE/3-KETOACYL-CoA 
TEHOLASE/ENOYL-CoA HYDRATASE, 


HLXB9 


GDB:136411 


HOMEO BOX GENE HB9; HLXB9 
SACRAL AGENESIS, HEREDITARY, 
Wrra PRESACRAL MASS, ANTERIOR 

MENINGOCELE, 


H0XA13 


GDB: 120656 


HOMEO BOX A13; H0XA13 


HPFH2 


GDB:128071 


HEREDITARY PERSISTENCE OF 
rxi 1 AJL JnUijyiUwJLV^JDliN, 
HETEROCELLULAR, INDIAN 




/^TYD •OO^fiOQO 


TTDY 
XlKA. 


T A 13 

lArJ 


vjDJd : 1 i4yoyuy 


AXITTTTDVOAyr TMTD A/^^D AXTTAT Tl'C'D'D'V 


IMMrzL 


trUJi : 1 1 4y y 1 y D 


/~^TT T TJO T\U T A T'/^T TT> Li" 1 " 1 ' U 

Lrl.Lij;K& ur, LA lUUKcl Lt, 
SYNDROME; GTS 


KCNH2 


GDB:138126 


LONG QT SYNDROME, TYPE 2; LQT2 


LAMBl 


GDB:119357 


LAMININ BETA 1; LAMBl 


LEP 


GDB: 136420 


LEPTIN; LEP 


MET 


Ca3B:120178 


MET PROTO-ONCOGENE; MET 


NCFl 


GDB: 120222 


GRANULOMATOUS DISEASE, 
CHRONIC, AUTOSOMAL 
CYTOCHROME-b-POSmVE FORM 


NM 


GDB:119454 


NEUTROPHIL CHEMOTACTIC 


OGDH 


GDB:118847 


ALPHA-KETOGLUTARATE 


OPNISW 


GDB: 119032 


TRITANOPIA 


PEXl 


GDB:9787110 


ZELLWEGER SYNDROME; ZS 
PEROXDSr-l; PEXl 
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Gene 


GDB Accession ID 


OMBfLink 


PGAM2 


GDB:120280 


PHOSPHOGLYCERATB MUTASE, 

"nPTTTPTP'Nrr' V OTT M QT fRT TWTT CW 
jjxir iUJJ-'iN I yjr iVi oUi5Ui\ii yjr 


PMS2 


GDB:386406 


POSTMEIOTIC SEGREGATION 
INCREASED (S. CEREVISIAE)-2; PMS2 


PONl 


GDB: 120308 


PARAOXONASE 1; PONl 


PPP1R3A 


GDB: 136797 


PROTEIN PHOSPHATASE 1, 
REGULATORY (INHIBrrOR) SUBUNTT 
i; rrr Ixo 


PRSSl 


GDB:119620 


PANCREATITIS, HEREDITARY; PCTT 
rssSJ 1 bAoJtS, oJiKJJNxi, 1 ; rKoo i 


PTC 


1 on A A 

CjDB:11o744 




PTPN12 


GDB: 136846 


TJ'Di^T'DTNT T'VTJr^CTKTD 'DTJAC'DtT A T A CI? 

NONRECEPTOR TYPE, 12; PTPN12 


RPIO 


GDB:13878o 


TDcrrMTTTC 'DTniVirmv'rr<~kO Am. vdi a 
KiillNXlio I'lvrMcJNitJoA-iU; KrlU 


RP9 


GDB:333931 


RETINmS PIGMENTOSA-9; RP9 


SERPINEl 


GDB:120297 


PLASMINOGEN ACTIVATOR 
INHIBITOR, TYPE I; PAH 


SGCE 


GDB:9958714 


MYOCLONUS, HEREDITARY 
ESSENTIAL 


SHFMl 


GDB:128195 


SPLir-HAND/FOOT DEFORMITY, TYPE 
tSHFDl 


SHH 


GDB:456309 


HOLOPROSENCEPHALY, TYPE 3; HPE3 
SONIC HEDGEHOG, DROSOPHILA, 
HOMOLOG OF; SHH 


SLC26A3 


GDB:138165 


DOWN-REGULATED IN ADENOMA; 
DRA CHLORIDE DIARRHEA, 
FAMILIAL; OLD 


SLC26A4 


GDB:5584511 


PENDRED SYNDROME; PDS 
DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 4; DFNB4 


SLOS 


GDB:385950 


SMira-LEMLI-OPITZ SYNDROME 


SMADl 


GDB:3763345 


SPINAL MUSCULAR ATROPHY, 
DISTAL, WITH UPPER LIMB 
PREDOMINANCE; SMADl 


TBXASl 


GDB:128744 


THROMBOXANE A SYNTHASE 1; 
TBXASl 


TWIST 


GDB: 135694 


ACROCEPHALOSYNDACTYLY TYPE 
m TWIST, DROSOPHILA, HOMOLOG 
OF; TWIST 


ZWSl 


GDB:120511 


ZELLWEGER SYNDROME; ZS 



Table 9: 



Genes, Locations and Genetic Disorders on Cbronaosome 8 
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Gene 


GDB AccessionD) 


OMEVf Link 


ACHM3 


GDB:9120558 


PINGELAPESE BLINDNESS 


ADRB3 


GDB:203869 


BETA-3-ADRENERGIC RECEPTOR; ADRB3 


ANKl 


GDB:1 18737 


SPHEROCYTOSIS, HEREDITARY; HS 


CAl 


GDB:1 19047 


CARBONIC ANHYDRASE I, 
ERYTHROCYTE, ELECTROPHORETIC 
VARIANTS OF; CAl 


CA2 


GDB:1 19739 


OSTEOPETROSIS WITH RENAL TUBULAR 
ACIDOSIS 


CCALl 


GDB:512892 


CHONDROCALCINOSIS WTIH 
EARLY-ONSET OSTEOARTHRITIS; CCAL2 


CLN8 


GDB:252118 


EPILEPSY, PROGRESSIVE, WITH MENTAL 

Kb 1 AtCUAl lUJN 5 lir JVLK. 


CMT4A 


GDB:138755 


CHARUU 1 -MAKlii- 1 UU 1 i± JN JiU xCUr A 1 xl Y 
4A; CMT4A 


CNGB3 


GDB:9993286 


PINGELAPESE BLINDNESS 


COHl 


GDB:2S2122 


i^Unhri oYrsUKUMc; CUiil 


CPP 


GDB:1 19798 


CERULOPLASMIN; CP 


CRH 


GDB:1 19804 


CORTICOTROPIN-RELEASING HORMONE; 
CRH 


CYPUBl 


GDB:120603 


ADRENAL HYPERPLASIA, CONGENITAL, 
DUE TO 11-@BETA-HYDR0XYLASE 
DEFICIKNCY 


CYP11B2 


GDB:120S14 


/^'\7*T'/^/^TTD rw JTC "QA C A CT TD'D A "WTTT "V "VTD 

CYluUHKUMc Jr4505 oUorAJVLLLY Aid, 

POLYPEPTIDE 2; CYPl 1B2 


DECRl 


GDB:453934 


2A@DIENOYL-CoA REDUCTASE; DECK 


DPYS 


GDB:5885803 


DIHYDROPYRIMIDINASE; DPYS 


DURSl 


GDB:9958126 


T\TTAXTD OX/XTT\T> /^li iTD 

DU ANB SYNDROME 


EBSl 


GDB:1 19856 


EPIDERMOLYSIS BULLOSA SIMPLEX, 
OGNA TYPE 


ECAl 


GDB:10796318 


JUVENILE ABSENCE 


EGI 


GDB:12883Q 


EPILEPSY, GENERALIZED, IDIOPATHIC; 
EGI 


EXTl 


GDB:135994 


EXOSTOSES, MULTIPLE, TYPE I; EXTl 
CHONDROSARCOMA 


EYAl 


GDB:5215167 


BRANCHIOOTORENAL DYSPLASIA EYES 
ABSENT 1; EYAl 


FGFRl 


GDB:1 19913 


ACROCEPHALOSYNDACTYLY TYPE V 
FIBROBLAST GROWTH FACTOR 

lSJi\^EiC x\JI\rlf PUTXvl 


GNRHl 


GDB:133746 


GONADOTROPIN-RELEASING HORMONE 1; 
GNRHl FAMILL\LHYPOGONADOTROPHIC 


GSR 


GDB:1 19288 


GLUTATHIONE REDUCTASE; GSR 
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Gene 


GDB AccessionID 


OMEMLink 


GULOP 


GDB: 128078 


SCURVY 


HR 


GDB:595499 


ALOPECIA UNIVERSALIS ATRICHIA WITH 
PAPULAR LESIONS HAIRLESS, MOUSE, 
HOMOLOGOF 


KCNQ3 


GDB:9787230 


CONVULSIONS, BENIGN FAMILIAL 
NEONATAL, TYPE 2; BFNC2 POTASSIUM 

r'WA'MMTIT VCVCTArSB nkTWn CTTDU AlV>nT V 

l^xlAJNiNllJ-., VL^JjlAlib-uAlliU, oUxSrAMlLiY 
Q, MEMBER 3 


KFM 


GDB:265291 


KLIPPEL-FEIL SYNDROME; KFS; KFM 


KWE 


GDB:9315120 


KERATOLYTIC WINTER ERYTHEMA 


LGCR 


GDB:120698 


LANGER-GIEDION SYNDROME; LGS 


LrL 


ljrLlJt5:lZU/UU 


ri X Jr JlKijJLr L^JrKlJ i JtSUN JIJVLIA, 1 x Jr Ji i 


MCPHl 


GDB:9834525 


MICROCEPHALY; MCT 


MOS 


GDB: 119396 


TRANSFORMATION GENE: ONCOGENE 

JVLUo 






TT? AXTCT7m>"N>rATrrrYKr nuMT?. riXTr*nnT7Kn3 
iJvAJNorUJvlVlAiilJJN IjnJNiS: UJNUUUHiNJi 

MYC;MYC 


NAT! 


GDB: 125364 


ARYLAMIDE ACETYLASE 1; AACl 


NAT2 


GDB: 125365 


ISONIAZID INACnVATION 


NBSl 


GDB:9598211 


NIJMEGEN BREAKAGE SYNDROME 


PLAT 


GDB:1 19496 


PLASMINOGEN ACTIVATOR, TISSUE; PLAT 


PLECl 


GDB:41 19073 


EPIDERMOLYSIS BULLOSA SIMPLEX AND 
LIMB-GIRDLE MUSCULAR DYSTROPHY 
PLECTIN 1; PLECl 


PRKDC 


GDB:234702 


SEVERE COMBINED IMMUNODEFICIENCY 
DISEASE-1; SCIDl PROTEIN KINASE, 
DNA-ACTIVATED, CATALYTIC SUBUNIT; 
rKivLlU 


rAMr3 


vjUd:1J14&/ 


SYNDROME; ZS 


DDI 




PTfTTXTI'IMQ "DT/^XirnXIT/^O A 1 > 'DDI 

KciilNilld JrlCrMiiJNlUoA-1, Krl 




Kjuo.yoo't /jo 




SFTPC 


(353:120373 


PULMONARY SURFACTANT APOPROTEIN 


SGMl 


GDB: 135350 


KLIPPEL-FEIL SYNDROME; KFS; KFM 


SPG5A 


GDB:250332 


SPASTIC PARAPLEGIA-5A, AUTOSOMAL 

"DT7r'T7CGT\/n» C!T>n^ A 
KllL/iiSoolVil, oirLrDA 


STAR 


GDB:635457 


STEROIDOGENIC ACUTE REGULATORY 

PT?nTFT>J- ^TAP 

xXxA^XXLllNj OXr\i\. 


TG 


GDB: 120434 


THYROGLOBULDST; TG 


TRPSl 


GDB:594960 


TRICHORHINOPHALANGEAL SYNDROME, 
TYPE I; TRPSl 
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Gene 


GDB AccessionlD 


OMIMLinfc — 


TTPA 


GDB:5 12364 


VITAMIN E, FAMILIAL ISOLATED 
DEFICIENCY OF; VED TOCOPHEROL 
CALPHA") TRANSFER PROTEIN- TTPA 


VMDl 


GDB:119631 


MACULAR DYSTROPHY, ATYPICAL 
VTTELLIFORM; VMDl 


WRN 


GDB:128446 


WERNER SYNDROME; WRN 



Table 10: Genes, Locations and Genetic Disorders on Chromosome 9 



Gene 


GDB AccessionlD 


OMIMLink 


ABCAl 


GDB:305294 


ANALPHALIPOPROTEINEMIA 
ATP-BINDING CASSETTE 1; ABCl 


ABLl 


GDB:1 19640 


ABELSON MURINE LEUKEMIA VIRAL 
ONCOGENE HOMOLOG 1; ABLl 


ABO 


GDB:118956 


ABO BLOOD GROUP; ABO 


ADAMTS13 


GDB:9956467 


THROMBOCYTOPENIC PURPURA 


AKl 


GDB:1 19664 


ADENYLATE KINASE-1; AKl 


ALAD 


GDB: 119665 


DELTA-AMINOLEVULINATE 
DEHYDRATASE; ALAD 


ALDHlAl 


GDB 


119667 


ALDEHYDE DEHYDROGENASE-1; ALDHl 


ALDOB 


GDB 


119669 


FRUCTOSE INTOLERANCE, HEREDITARY 


AMBP 


GDB 


120696 


PROTEIN HC;HCP 


AMCDl 


GDB 


437519 


ARTHROGRYPOSIS MULTIPLEX 
CONGENITA DISTAL, TYPE 1; AMCDl 


ASS 


GDB:1 19010 


CTTRULLINEML^ 


BDMF 


GDB:9954424 


BONE DYSPLASIA WITH MEDULLARY 
FIBROSARCOMA 


BSCL 


GDB:9957720 


SEIP SYNDROME 


C5 


GDB:1 19734 


COMPLEMENT COMPONENT-5, 
DEHCIENCYOF 


CDKN2A 


GDB:335362 


MELANOMA CUTANEOUS MALIGNANT, 2; 
CMM2 CYCLIN-DEPENDENT KINASE 
INHTOirOR 2A; CDKSr2A 


CHAC 


GDB:6268491 


CHOREOACANTHOCYTOSIS; CHAC 


CHH 


GDB:138268 


CARTILAGE-HAIR HYPOPLASL^; CHH 


CMDIB 


GDB:677147 


CARDIOMYOPATHY, DILATED IB; CMDIB 


C0L5A1 


GDB:131457 


COLLAGEN, TYPE V, ALPHA-1 
POLYPEPTIDE; C0L5A1 


CRAT 


GDB:359759 


CARNITINE ACETYLTRANSFERASE; CRAT 


DBH 


GDB:1 19836 


DOPAMINE BETA-HYDROXYLASE, 
PLASMA; DBH 
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Gene 


GDB AccessionlD 


OMIMLink 


DFNBll 


GDB: 1220180 


DEAFNESS, NEUROSENSORY, 

/\U 1 UoLliVLAi-* ivJii^iioMV^^ /,JJrJNJo/ 






rUHATTNTnCC XTRTTPr^QPXTCnP V 
jJJi/VriNJcioo, JNliUJvVJoJDiNijUJxx 5 

AUTOSOMAL RECESSIVE, 7; DFNB7 


DNAIl 


GDB:11500297 


MMOTILE CILIA SYNDROME-1; ICSl 


DYS 


GDB: 137085 


DYS AUTONOMIA, FAMILIAL; DYS 


JJ I 1 1 




"nVQTnXTTA 1 Tr\T> Qir\K{» "RVTI 
Uiol UrsiA 1, 1 UJxolUJN, U I 1 i 


ENG 


GDB:137193 


ENDOGLIN; ENG 


EPB72 


GDB:128993 


ERYTHROCYTE SURFACE PROTEIN BAND 
/.Z, Jirii /z o 1 UMAl UC Y 1 Uoio 1 


FANCC 


GDB:132672 


FANCONI ANEMIA, COMPLEMENTATION 
vrKUUx* rAUv^ 




/TT^'D • 1 /1 1 <10 


T7DT T/^T/^OTJ 1 ^ 'DTC'DXJ/^'DTT A T* A CC 1 . "DXaTil 

r xCUU 1 Uo Ji- 1 jO-olorxiUlrjlA 1 AJSxi 1 ; tor 1 




LrDx5:zMJ41z 


"DT TV J TV A TVyf A TVOTJ /^r\XTO"CXTTT' A T 

MUSCULAR DYSTROPHY; FCMD 


rJKJJA 


r^T^TJ. 1 1 QQ^I 


T7T> TC'r\'D "DT/^XJ ATA VT A 1 . T7*DT\ A 1 

rKiiiJJKblL^Jl Al AAIA 1; rKUAl 


AT T 

(jrALl 


lyy /I 


A T A /^'T/^CTJlVyrT A 




/^TM5»1 OC/C1 1 

vjUo.izoOli 


MYrJDKCjL.YCljNiiJVllA^ loULAixiU 
NONKETOTIC, TYPE I; NKHl 


GNE 


GDB:9954891 


INCLUSION BODY MYOPATHY; IBM2 




<T.'r»D.Q'70/l'i1 rt 

LrUx5:7 /o4zlU 


ucJNiUoJrAoM 1; CroMl 


GSN 


GDB: 120019 


AMYLOIDOSIS V GELSOLIN; GSN 


HSD17B3 


GDB:347487 


PSEUDOHERMAPHRODITISM, MALE. WIIH 

Vj Y JN JiUUMAo i lA 


HSNl 


GDB:3853677 


NEUROPATHY, HEREDITARY SENSORY, 
1 YrJb 1 


IBM2 


GDB:3801447 


INCLUSION BODY MYOPATHY; IBM2 


LALL 


GDB:9954426 


LEUKEMIA, ACUTE, WITH 

T "VTVVrDXJ/^A JT A T*f\T TC "CTC A TT TD'DO . T A T T 

LrYMrHUMAlUUIS rJbAlUKco; LAl^L 


T r^i^Q 


vji->o:Jooi4l 


T "CTTU A T r^/^XT/^'DXTTT' A T /^/^XTP^ A /^TTTD'D 

L/blilAL CUJNCjhJNiiALCUNiKAClUKJi 
SYNDROME; LCCS 


iAjNiDzti 




iJYoiKUx'xlY, rlU 1 liiKilii lYrxi 


T A/TVI X> 

J-»JVlAlr> 




XT ATT 13 A T'DT T A C\/XTT\T>/^TV/fX?. XTDC1 

JNAlL-FAlbLLA iSYJNDKUMJb; JNriSl 


MLLT3 


GDB: 138172 


MYELOID/LYMPHOID OR MIXED LINEAGE 
LiCUJvcMlA, iKAMoLUUAicD lU, i; ML1,13 


MROS 


GDB:9954430 


MELKERSSON SYNDROME 


MSSE 


GDB:128019 


EPITHELIOMA, SELF-HEALING 


NOTCHl 


GDB:131400 


NOTCH, DROSOPHILA, HOMOLOG OF, 1; 
NOTCHl 


ORMl 


GDB:120250 


OROSOMUCOID 1; ORMl 
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Gene 


ivUxs ACcessionjLLi 




PAPPA 


GDB:134729 


PREGNANCY-ASSOCIATED PLASMA 
PROTEIN A; PAPPA 


PP5K1B 


GDB:686238 


FRIEDREICH ATAXIA 1 ; FRDAl 


PTCH 


GDB:1 19447 


BASAL CELL NEVUS SYNDROME; BCNS 
PATCHED, DROSOPHILA, HOMOLOG OF; 


T>T/T.01 
r IKJoi 




rXUo 1 AurJLJyNiJlN-JbJNDOrEKOXiDASE 
SYNTHASE 1;PTGS1 


RLNl 


GDB:1 19552 


RELAXIN;RLN1 


RLN2 


GDB:1 19553 


RELAXESr, OVARIAN, OF PREGNANCY 


RMRP 


GDB:120348 


MITOCHONDRIAL RNA-PROCESSING 
ENDORIBONUCLEASE, RNA COMPONENT 
OF; RMRP; CARTILAGE-HAIR 
HYPOPLASIA CHH 


R0R2 


GDB:136454 


BRACHYDACTYLY, TYPE B; BDB 
ROBINOW SYNDROME, RECESSIVE FORM 
NEUROTROPHIC TYROSESIE KINASE, 
RbCEPlOR-RELATED 2; NTRKR2 


KrlJl 




RETHSlTlb PIGMENTOSA-DEAFNESS 
SYNDROME 1, AUTOSOMAL DOMINANT 


SARDH 


GDB:9835149 


SARCOSINEMIA 


TDFA 


GDB:9954420 


FACTOR, AUTOSOMAL 


TEK 


GDB:344185 


VENOUS MALFORMATIONS, MULTIPLE 
CUTANEOUS AND MUCOSAL; VMCM TEK 
TYROSINE KINASE, ENDOTHELIAL; TEK 




\JUD,La>\J 1 DD 




TYRPl 


GDB:126337 


TYROSINASE-RELATED PROTEIN 1; TYRPl 
ALBINISM m XANTHISM 


XPA 


GDB:125363 


XERODERMA PIGMENTOSUM I 



Table 1 1 : Geaes, Locations and Genetic Disorders on Chromosome 10 



Gene 


GDB Accession ID 


OMBILink 


CACNB2 


GDB:132014 


CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, BETA-2 
SUBUNTT; CACNB2 


C0L17A1 


GDB:131396 


COLLAGEN, TYPE XVII, ALPHA-1 
POLYPEPTIDE; C0L17A1 


CUBN 


GDB:636049 


MEGALOBLASTIC ANEMIA 1; MGAl 


CYP17 


GDB:1 19829 


ADRENAL HYPERPLASIA, 
CONGENITAL, DUE TO 
17-ALPHA-HYDROXYLASE 
DEHCIENCY 
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Gene 


GDB Accession ID 




CYP2C19 


GDB:1 19831 


CYTOCHROME P450, SUBFAMILY 

nC, POLYPEPTIDE 19; CYP2C19 


CYP2C9 


GDB:131455 


CYTOCHROME P450, SUBFAMILY 
nC, POLYPEPTIDE 9; CYP2C9 


EGR2 


GDB: 120611 


EARLY GROWTH KESPONSE-2; 


EMX2 


GDB:277886 


EMPTY SPIRACLES, DROSOPHILA, 
2, HOMOLOG OF; EMX2 


EPT 


GDB:9786112 


EPILEPSY, PARTIAL; EPT 


ERCC6 


GDB:1 19882 


EXCISION-REPAIR 
CROSS-COMPLEMENTING RODENT 
REPAIR DEFICIENCY, 
COMPLEMENTATION 


FGFR2 


GDB:127273 


ACROCEPHALOSYNDACTYLY 
1 Yrxi V rLDJvUoLAJSl CjRUWTH 
FACTOR RECEPTOR-2; FGFR2 






rltiA.UJsJJNAiSJb-1; HK.1 


HOXll 


GDB: 119607 


HOMEO BOX-1 1; HOXl 1 


HPS 


GDB: 127359 


HERMANSKY-PUDLAK 
or JnJJK.UMx1; liro 


TT OP A 


<T.'r»'D.1 1 09^< 

yjUo.lLyi'tD 


UN lliKLllUKlJN-2 KBCnPTOR, 
ALPHA; IL2RA 


LGIl 


GDB:9864936 


EPILEPSY, PARTIAL; EPT 


LIPA 


GDB:120153 


WOLMAN DISEASE 


MATIA 


GDB:129077 


METHIONINE 

ADENOSYLTRANSFERASE 

DEHCIENCY 


MBL2 


GDB:120167 


MANNOSE-BINDING PROTEIN, 
SERUM; MBPl 


MBCI67 


GDB:120185 


PROLIFERATION-RELATED Ki-67 
ANTIGEN; MKI67 


MXIl 


GDB:137182 


MAX INTERACTING PROTEIN 1 ; 
MXIl 


OAT 


GDB:120246 


ORNITHINE AMINOTRANSFERASE 
DEFICIENCY 


0ATL3 


GDB:215803 


OKNTTHIISIE AMINOTRANSFERASE 
DEFICIENCY 


PAX2 


GDB:138771 


PAIRED BOX HOMEOTIC GENE 2; 
PAX2 


PCBD 


GDB:138478 


PTERIN-4-ALPHA-CARBINOLAMINE 
DEHYDRATASE; PCBD 
PRIMAPTERINURIA 


PEOl 


GDB:632784 


PROGRESSIVE EXTERNAL 
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Gene 


GDB Accession ID 


OMDViliiik " ■■^■-•-^ 






OPHTHALMOPLEGIA; PEO 


PHYH 


GDB:9263423 


KEFSUM DISEASE 

PHYTANOYL-CoA HYDROXYLASE; 
PHYH 

X XX X x^ 


PNLIP 


GDB:127916 


LIPASE, CONGENITAL ABSENCE OF 
PANCREATIC 


PSAP 


GDB:120366 


PROSAPOSIN; PSAP 


PTEN 


GDB:6022948 


MACROCEPHALY, MULTIPLE 
LIPOMAS AND HEMANGIOMATA 
MULTIPLE HAMARTOMA 
SYNDROME; MHAM POLYPOSIS, 
JUVENILE INTESTINAL 
PHOSPHATASE AND TENSIN 
HOMOLOG; PTEN 


RBP4 


GDB:120342 


RETBSrOL-BINDING PROTEIN, 
PLASMA; RBP4 


RDPA 


GDB:9954445 


REFSUM DISEASE WITH 
INCREASED PIPECOLICACIDEMIA; 

S\±JJrJ\. 


RET 


GDB:120346 


RET PROTO-ONCOGENE; RET 


SDFl 


GDB:433267 


STROMAL CELL-DERIVED FACTOR 
1; SDFl 


SFTPAl 


GDB:1 19593 


PULMONARY SURFACTANT 
APOPROTEIN PSP-A; PSAP 


SFTPD 


GDB:132674 


PULMONARY SURFACTANT 

APOPPOTT7TM PCD "Tk* PCD T\ 


SHFM3 


GDB-3860^0 


QPT TT TTAxm/TTr^Pinr 
»jx J-ri 1 -rLfVLNi-i/p iJU 1 

MALFORMATION, TYPE 3; SHFM3 


SIAL 


GDB:6549924 


NEURAMINIDASE DEFICIENCY 


THC2 


GDB:10794765 


THROMBOCYTOPENIA 


TISIFRSF6 


GDB:132671 


APOPTOSIS ANTIGEN 1- APTl 


UFS 


GDB:6380714 


UROFACIAL SYNDROME; UFS 


UROS 


GDB: 128112 


PORPHYRIA, CONGENITAL 
ERYTHROPOIETIC; CEP 



Table 12: Genes, Locations and Genetic Disorders on Chromosome 1 1 



Gene 


GDB Accession ID 


OMIMLink 


AA 


GDB:568984 


ATROPHIA AREATA; AA 


ABCC8 


GDB:591370 


SULFONYLUREA RECEPTOR; SUR 
PERSISTENT HYPERINSULINEMIC 
HYPOGLYCEMIA OF INFANCY 
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Gene 


GDB Accession ID 


OMEMLink 


Ar'ATl 


\JUO. l^OoO 1 


AT PITA 'N/rFTRVT APRTHAP'RTTr'Ar'TnTTDTA 


AT YA 


VJUJD • JL KJj 




AMPD3 


GDB:136013 


ADENOSINE MONOPHOSPHATE 
DEAMINASE-3; AMPD3 


ANC 


GDB:9954484 


CANAL CARCINOMA 


APOAl 


GDB:1 19684 


AMYLOIDOSIS, FAMILIAL VISCERAL 
LIPOPROTEIN; APOAl 


AP0A4 


GDB: 119000 


APOLIPOPROTEIN A-IV; AP0A4 


AP0C3 


GDB:119001 


APOLIPOPROTEIN C-IE; AP0C3 


AIM 




AT A VTA HHRT AXmTROTACTA- AT 


BSCL2 


GDB:9963996 


SEIP SYNDROME 


BWS 


GDB:120567 


BECKWITH-WIEDEMANN SYNDROME; 

"DWG 


CALCA 


GDB:120571 


CALCITONlN/CALCrrONlN-RELATED 

POT VT5T7PTTnP A T PTT A • P A T A 


CAT 


GDB:119049 


CATALASE; CAT 


CCNDl 


GDB:128222 


LEUKEMIA, CHRONIC LYMPHATIC; CLL 
CYCLED Dl; CCNDl 


CD3E 


GDB:1 19764 


CD3E ANTIGEN, EPSILON POLYPEPTIDE; 

CD3E 


CD3G 


GDB: 119765 


T3 T-CELL ANTIGEN, GAMMA CHAIN; 
T3G; CD3G 


CD59 


GDB:119769 


CD59 ANTIGEN P18-20; CD59 HUMAN 
LEUKOCYTE ANTIGEN MICH; MICl 1 


CDKNIC 


GDB:593296 


CYCLIN-DEPENDENT KINASE INHIBITOR 


CLN2 


GDB:12522S 


CEROID-LIPOFUSCINOSIS, NEURONAL 2, 
LATE INFANTILE TYPE; CLN2 


CNTF 


GDB:125919 


CILLERY NEUROTROPHIC FACTOR; CNTF 


CPTIA 


GDB:597642 


HYPOGLYCEMIA, HYPOKETOTIC, WITH 
DEHCIENCY OF CARNTTINE 
PALMITOYLTRANSFERASE CARNITINE 
PALMITOYLTRANSFERASE I, LIVER; 
CPTIA 


CTSC 


GDB:642234 


KERATOSIS PALMOPLANTARIS WITH 
PERIODONTOPATinA KERATOSIS 
PALMOPLANTARIS WITH 

ONYCHOORYPOSIS CATHEPSIN C; CTSC 


DDBl 


GDB:595014 


DNA DAMAGE-BESIDING PROTEIN; DDBl 


DDB2 


GDB:595015 


DNA DAMAGE-BINDING PROTElN-2; DDB2 
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Gene 


GDB Accession ED 


OMIMLiiJc ^ 


DHCR7 


GDB:9835302 


SMITH-LEMLI-OPrrZ SYNDROME 


DLAT 


GDB:1 18785 


CIRRBOSIS, PRIMARY; PBC 


DRD4 


GDB:127782 


DOPAMINE RECEPTOR D4; DRD4 


ECB2 


GDB:9958955 


POLYCYTHEMIA, BENIGN FAMILIAL 


ED4 


GDB:9837373 


DYSPLASL^, MARGARITA TYPE 


EVRl 


GDB:134029 


EXUDATIVE VirREORBTINOPATEIY, 
FAMILIAL; EVR 

EXT2GDB:344921EXOSTOSES, MULTIPLE, 

1 iJrXi JJL, s2tJ\,yL ^Jtl\JJNJJKUoAKL/UMA 








FSHB 


GDB:1 19955 


FOLUCLE-STIMULATING HORMONE, 






17PPBTTTM"nm AAA?' njATKr 1. UTIII 




csriQ.'^ 1 s&nn 

VjUJS.XXooUU 


rrlKNlUlUUd AJNJiMlA, L/UJNvjJbJNlTAL, DUE 
TO DEFECT OF INTRINSIC FACTOR 


GSDIB 


GDB:9837619 


GLYCOGEN STORAGE DISEASE lb 


GSDIC 


GDB:9837637 


STORAGE DISEASE Ic 


HBB 


GDB: 119297 


HEMOGLOBIN-BETA LOCUS; HBB 


HBBPl 


GDB: 120035 


HEMOGLOBIN-BETA LOCUS; HBB 


HDD 


GDB:119298 


HEMOGLOBIN-DELTA LOCUS; HBD 


HBEl 


GDB:1 19299 


HEMOGLOBIN-EPSILON LOCUS; HBEl 


HBGl 


GDB:1 19300 


HEMOGLOBIN, GAMMA A; HBGl 


HBG2 


GDB:1 19301 


HEMOGLOBIN, GAMMA G; HBG2 


HMBS 


GDB:120528 


PORPHYRIA, ACUTE INTERMITTENT; AIP 


HND 


GDB:9954478 


HARTNUP DISORDER 


H0MG2 


GDB:9956484. 


MAGNESIUM WASTING, RENAL 


HRAS 


GDB:120684 


BLADDER CANCER V-HA-RAS HARVEY 
RAT SARCOMA VIRAL ONCOGENE 
HOMOLOG; HRAS 


HVBSl 


GDB: 120069 


CANCER, HEPATOCELLULAR 


roDM2 


GDB:128530 


DIABETES MELLITUS, 
INSULIN-DEPENDENT, 2 DIABETES 

A/TRT T TTTTQ TTR/ThXTTT T? rfcXTCTJT 
IVLcJLJLl i U o, J U V JtiJN LbXi-UJN oJi 1 

INSULIN-DEPENDENT; IDDM 


IGER 


GDB:1 19696 


IgE RESPONSIVENESS, ATOPIC; IGER 


INS 


GDB:119349 


INSULIN; INS 


IBS 


GDB: 1201 11 


JACOBSEN SYNDROME; JBS 


KCNJU 


GDB:7009893 


POTASSIUM CHANNEL, 
INWARDLY-RECTIFYING, SUBFAMILY J, 
MEMBER 11; KCNJll PERSISTENT 
HYPERINSULINEMIC HYPOGLYCEMLV OF 
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Gene 


GDB Accession ID 


OMIMLiiik'^' 






INFANCY 


KCNJl 


GDB:204206 


POTASSIUM CHANNEL, 
MEMBER 1; KCNJl 




fTnR'74.1 '744 




LDHA 


GDB: 120141 


LACTATE DEHYDROGENASE-A; LDHA 


LRP5 


GDB:9836818 


OSTEOPOROSIS-PSEUDOGLIOMA 
SYNDROME; OPPG HIGH BONE MASS 


MENl 


GDB: 120173 


MULTIPLE ENDOCRINE NEOPLASIA, TYPE 
1;MEN1 


MIX 


GDB:128819 


MYELOID/LYMPHOID OR 
MDCED-LINEAGE LEUKEMIA; MLL 


MTACRl 


GDB: 125743 


MULTIPLE TUMOR ASSOCIATED. 
CHROMOSOME REGION 1; MTACRl 


MYBPC3 


GDB:579615 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 4; CMH4 
MYOSIN-BINDING PROTEIN C, CARDIAC; 
MYBPC3 


MY07A 


GDB:132543 


MYOSIN VnA; MY07A DEAFNESS, 
NEUROSENSORY, AUTOSOMAL 
RECESSTS^, 2; DENB2 DEAFNESS, 
AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 11; 

niTKT A 1 1 
JJJriN Al i 


NNOl 


GDB:10450513 


SIMPLE, AUTOSOMAL DOMINANT 


OPPG 


GDB:378943& 


OSTEOPOROSIS-PSEUDOGLIOMA 

o X JNJL/K.L'iVlJLi, KJrryj 






RECESSIVE 


IT AA.O 




rPi \ \\ r,}J j5\JJ^ JtlUlVLcUlll^ VXDJNxi 0, JrAAO 


PC 


GDB:119472 


PYRUVATE CARBOXYLASE DEHCIENCY 


PDXl 


GDB:9836634 


PYRUVATE DEHYDROGENASE COMPLEX, 


PfrT 9 
J: VJl^^ 




P AP A r^AXmi TOA/T A Q T7AlV/nTTAT 

NONCHROMAFFIN, 2; PGL2 


PGR 


GDB:1 19493 


PROGESTERONE RESISTANCE 






pnPPTIVPTA PTTRCTRP TVPT7- 'OCXOC^ 


PTH 


GDB:119522 


PARATHYROID HORMONE; PTH 


PTS 


GDB:118856 


6-@PYRUVOYLTETRAHYDROPTERIN 
SYNTHASE; PTS 


PVRLl 


GDB:583951 


ECTODERMAL DYSPLASIA, CLEFT LIP 
AND PALATE, HAND AND FOOT 
DEFORMITY, DYSPLASIA, MARGARITA 



178 



wo 2004/010106 PCT/US2003/023075 



Gene 


l^nTt Aprp«!einii TTI 








TYPE POUOVmUS RECEPTOR RELATED; 
PVRR 

X T xvxx 


PYGM 


GDB:120329 


GLYCCX5EN STORAGE DISEASE V 


RAGl 


GDB: 120334 


RECOMBINATION ACTIVATING GENE-1 ; 
RAGl 


RAG2 


GDB-125186 


J'^J-'V^ vlVXI> JJLN-rV. X XVyI>l ciX^ X 1 V A X XIN VJ VJxlXN JD^Z ^ 

RAG2 


ROMl 


GDB:120350 


ROD OUTER SEGMENT PROTEIN-1; ROMl 


SAAl 


<a)B:120364 


SERUM AMYLOID Al; SAAl 


SCA5 




^PTNrnn^ijp'RPT T ap atayta ^- qpa^ 


SCZD2 


GDB:118874 


DISORDER-2; SCZD2 


SDHD 


GDB:132456 


PARAGANGLIOMAS, FAMILIAL 


SERPINGl 


GDB:1 19041 


ANGIONEUROTIC EDEMA, HEREDITARY; 

TTAMP 

XxrVlNJC/ 


SMPDl 


rrDR- 198144 


TsTTPA/TAXTM PTPT^ "ni^PACT? 
INXIilVlAlNlN-JrlLyJS. lyXoHAoxl 


TCIRGl 


GDB:9956269 


OSTEOPETROSIS, AUTOSOMAL 
RECESSIVE 


TCL2 


GDB:9954468 


LEUKEMIA, ACUTE T-CELL; ATL 


TECTA 


GDB:6837718 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 8; 
DFNA8 DEAFNESS, AUTOSOMAL 

SENSORINEURAL, 12; DFNA12 


TH 


vJXyJj. 1 X j'UXZ^ 


TVPnoiTxrR TTvnprkYVT act?* txt 






TPPTTAT AW 






TT TlVyf np QT TQPPPTTOTT TTV riTTXTO 1 fW . 

TSGlOl 




vJX^iD . 1 ZrUM- / U 


A T TlTKTTClV/r T 


USHIC 


GDB: 132544 


USHER SYNDROME, TYPE IC; USHIC 


VMD2 


GDB:133795 


VnELLIFORM MACULAR DYSTROPHY; 
VMD2 


VRNI 


GDB: 135662 


VITREORETINOPATHY, NEOVASCULAR 
INFLAMMATORY; VRNI 


WTl 


GDB: 120496 


FRASIER SYNDROME WILMS TUMOR; 
WTl 


WT2 


GDB: 11 8886 


MULTff LE TUMOR ASSOCIATED 
CHROMOSOME REGION 1; MTACRl 


ZNF145 


GDB:230064 


PROMYELOCYTIC LEUKEMIA ZINC 
FINGER; PLZF 
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Table 13: Genes, Locations and Genetic Disorders on Chromosome 12 



Gene 


GDB Accession ID 


OMBMLink 


A2M 


GDB: 119639 


ALPHA-2-MACROGLOBULIN; A2M 


AAAS 


GDB:9954498 


GLUCOCORTICOID DEFICIENCY 
AND ACHALASIA 


ACADS 


GDB:118959 


ACYL-CoA DEHYDROGENASE, 
SHORT-CHAIN; ACADS 


ACLS 


GDB: 136346 


ACROCALLOSAL SYISIDROME; ACLS 


ACVRLl 


GDB:230240 


OSLER-RENDU-WEBER SYNDROME 
2; 0RW2 ACnVIN A RECEPTOR, 
TYPE n-UKE KINASE 1; ACVRLl 


ADHR 


GDB:9954488 


VTIAISflN D-RESISTANT RICKETS, 
AUTOSOMAL DOMINANT 


ALDH2 


GDB: 119668 


ALDEHYDE DEHYDROGENASE-2; 
ALDH2 


AMHR2 


GDB:696210 


ANTI-MULLERIAN HORMONE TYPE 
n RECEPTOR; AMHR2 


AOM 


GDB:118998 


STICKLER SYNDROME, TYPE 1; STLl 


AOP2 


rjma.i^,! OS'S 


AV^UArL>KJJN-Z; A.\lrZ LflAiihlho 

INSIPIDUS, RENAL TYPE DIABETES 
INSff IDUS, RENAL TYPE, 
AUTOSOMAL RECESSIVE 


ATD 


GDB:696353 


ASPHYXIATING THORACIC 
DYSTROPHY; ATD 


ATP2A2 


GDB: 11 971 7 


ATPase, Ca(2+)-TRANSP0RTING, 
SLOW-TWITCH; ATP2A2 
DARffiR-WHTTE DISEASE; DAR 


BDC 


GDB:5584359 


BRACHYDACTYLY, TYPE C; BDC 


CIR 


GDB:1 19729 


COMPLEMENT COMPONENT-Clr, 
DEHCIENCY OF 


CD4 


GDB:1 19767 


T-CELL ANTIGEN T4/LEU3; CD4 


CDK4 


GDB:204022 


CYCLESr-DEPENDENT KINASE 4; 
CDK4 


CNAl 


GDB:252119 


CORNEA PLANA 1; CNAl 


POT 2A1 




o 1 iUJsJLcK a I MDKUJVLb, 1 Yrli 1; bTLl 
COLLAGEN, TYPE H, ALPHA-1 
CHAIN; COL2A1 

ACHONDROGENESIS, TYPE H; ACG2 


CYP27B1 


GDB:9835730 


PSEUDOVITAMIN D DEFICIENCY 
RICKETS; PDDR 


DRPLA 


GDB:270336 


DENTATORUBRAL-PALLIDOLUYSIA 
N ATROPHY; DRPLA 


ENUR2 


GDB:666422 


ENURESIS. NOCTURNAL, 2; ENUR2 
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Gene 


GDB Accession ID 




FEOMl 


GDB:345037 


FBROSK OF EXTEIAOCULAR 
MUSCLES, CONGENITAL; FEOM 


FPF 


GDB:9848880 


PERIODIC FEVER, AUTOSOMAL 
DOMINANT 


GNB3 


GDB:120005 


GUANINE NUCLEOTIDE-BINDING 
PROTEIN, BETA POLYPEPTIDE-3; 


C3NS, 


GDB:120006 


MUCOPOLYSACCHARIDOSIS TYPE 
iijjj 


HAL 


GDB: 120746 


HISTIDINEMIA 


HBPl 


GDB:701889 


BRACHYDACTYLY WITH 
HYPERTENSION 


HMGIC 


GDB:362658 


HIGH MOBILITY GROUP PROTEIN 
loUrUKJvL JtlMQjlU 


HMN2 


GDB:9954508 


MUSCULAR ATROPHY, ADULT 
SrUNAL 


HPD 


GDB:I35978 


TYROSINEMIA, TYPE m 


IGFl 


GDB:120081 


INSULINLIKE GROWTHFACTOR 1; 
IGFl 


KCNAl 


GDB:127903 


POTASSIUM VOLTAGE-GATED 
CHANNEL, SHAKER-RELATED 
SUBFAMILY, MEMBER 


KERA 


GDB:252121 


CORNEA PLANA 2; CNA2 


KRAS2 


GDB:120120 


V-KI-RAS2 KIRSTEN RAT SARCOMA 
2 VIRAL ONCOGENE HOMOLOG; 
KRAS2 


KRTl 


GDB:128198 


KERATIN 1; KRTl 


KRT2A 


GDB:407o40 


ICHTHYOSIS, BULLOUS TYPE 
KERATIN 2A;KRT2A 


KRT3 


GDB:136276 


KERATIN 3; KRT3 


KRT4 


GDB: 120697 


KERATIN 4; KRT4 


KRT5 


GDB:128110 


EPIDERMOLYSIS BULLOSA 
HERPETIFORMIS, DOWLING-MEARA 
TYPE KERATIN 5; KRT5 


KRT6A 


GDB:128111 


KERATIN 6A;KRT6A 


KRT6B 


GDB:128113 


KERATIN 6B;KRT6B 
PACHYONYCHIA CONGENITA, 
JACKSON-LAWLER TYPE 


KRTHB6 


GDB:702078 


MONILETHRIX KERATIN, HAIR 
BASIC (TYPE 11) O 


LDHB 


GDB:120147 


LACTATE DEHYDROGENASE-B; 

LDHB 


LYZ 


GDB: 120160 


AMYLOIDOSIS, FAMILIAL 
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Gene 


GDB Accession D) 








VISCERAL LYSOZYME; LYZ 


MGCT 


GDB:9954504 


TESTICULAR TUMORS 


MPE 


GDB:120191 


MALIGNANT PROLIFERATION OF 


MVK 


GDB:134189 


MEVALONICACIDURIA 


MYL2 


GDB:128829 


MYOSIN, LIGHT CHAIN, 
REGULATORY VENTRICULAR; 

iVi 1 LiZ 


XTCl 
JNol 






OAP 


GDB: 120245 


OSTEOARTHROSIS, PRECOCIOUS; 


PAH 


GDB:119470 


PHENYLKETONURIA; PKUl 


PPKB 


GDB:696352 


PALMOPLANTAR KERATODERMA, 
BOTHNIAN TYPE; PPKB 


PRB3 


GDB: 119513 


PAROTID SALIVARY 


T)VT? 1 

JrAKi 




ZjI^L,L, W iJVjxiK o I JNiJJcCiJIVlii, jLo 

PEROXISOME RECEPTOR 1; PXRl 


"DT G 




An?nA>rnT at niA TTP'O'R'n'pTA'Dv 


RSN 


GDB:139158 


RESTIN;RSN 


SAS 


GDB:128054 


SARCOMA AMPLIFIED SEQUENCE; 
SAS 


SCA2 


GDB:128034 


SPINOCEREBELLAR ATAXIA 2; SCA2 
ATAXIN-2; ATX2 


SCNNIA 


GDB:366596 


SODIUM CHANNEL, 
NONVOLTAGE-GATED, 1; SCNNIA 


SMAL 


GDB:9954506 


SPINAL MUSCULAR ATROPHY, 
CONGENITAL NONPROGRESSIVE, 
OF LOWER LIMBS 


SPPM 


GDB:9954502 


SCAPULOPERONEAL MYOPATHY; 
SPM 


SPSMA 


GDB:9954510 


SCAPULOPERONEAL 
AMYOTROPHY, NEUROGENIC, NEW 
ENGLAND TYPE 


TBX3 


GDB:681969 


ULNAR-MAMMARY SYNDROME; 
UMST-BOX3;TBX3 


TBX5 


GDB:6175917 


HOLT-ORAM SYNDROME; HOS 
T-BOX5;TBX5 


TCFl 


GDB:125297 


TRANSCRIPTION FACTOR 1, 
HEPATIC; TCFl MATURITY-ONSET 
DIABETES OF THE YOUNG, TYPE ffl; 

M0DY3 


TPIl 


GDB:119617 


TRIOSEPHOSPHATE ISOMERASE 1; 
TPIl 
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Gene 


GDB Accession ID 


OMlSlXiiik' ^^u^.^ 


TSC3 


GDB:127930 


SCLEROSIS-3; TSC3 


UUR. 


GDB:594089 


UTEEUNE 


VDR 


GDB:120487 


VITAMIN D-KESESTANT RICKETS 
Wrra END-ORGAN 
UNRESPONSIVENESS TO 
1,25-DIHYDROXYCHOLECALCIFERO 
L VITAMIN D. RECEPTOR; VDR 


VWF 


GDB:1 19125 


VON WILLEBRAND DISEASE; VWD 



Table 14: Genes, Locations and Genetic Disorders on Chromosome 13 



Gene 


GDB Accession ID 


OMIMLink 


ATP7B 


GDB:120494 


WILSON DISEASE; WND 


BRCA2 


GDB:387848 


BREAST CANCER 2, EARLY-ONSET; 
BRCA2 


BRCDl 


GDB:9954522 


BREAST CANCER, DUCTAL, 1; BRCDl 


CLN5 


GDB:230991 


CEROID-LIPOFUSCNOSIS, NEURONAL 5; 
CLN5 


CPB2 


(H)B:129546 


CARBOXYPEPTIDASE B2, PLASMA; CPB2 


ED2 


GDB:9834522 


ECTODERMAL DYSPLASIA, HIDROTIC; 

BED 


EDNRB 


GDB: 129075 


ENDOTHELDSr-B RECEPTOR; EDNRB 
HIRSCHSPRUNG DISEASE-2; HSCR2 


ENURl 


GDB:594516 


ENURESIS, NOCTURNAL, 1; ENURl 


ERCC5 


GDB:120515 


EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 5; 

ERCC5 


FIO 


GDB:119890 


X, QUANTITATIVE VAIOATION IN 
FACTOR X DEFICIENCY; FIO 


F7 


GDB:1 19897 


FACTOR Vn DEFICIENCY 


GJB2 


GDB:125247 


GAP JUNCTION PROTEIN, BETA-2, 26 KD; 
GJB2 DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 1; DFNBl 
DEAFNESS. AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 3; 
DFNA3 


GJB6 


GDB:9958357 


ECTODERMAL DYSPLASIA, HIDROTIC; 
HED DEAFNESS, AUTOSOMAL 
DOMINANT NONSYNDROMIC 
SENSORINEURAL, 3; DFNA3 


IPFl 


GDB:448899 


INSULIN PROMOTER FACTOR 1; IPFl 


MBSl 


GDB:128365 


MOEBIUS SYNDROME; MBS 
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Gene 


GDB Accession ID 


OMIMLuik 






PmsirJPTSJTT A T 






nr vPTMRiVyfT A vxiTrYnr* t 

VjIj 1 ^JuNX^iVJULn, JSJ3 1 LI 1 lU, 1 




\JUX>,L 10/ D*v 


HT A'n'nT7P PAMPPP TJTJTTMriTJT ACTOA>rA« 

RBI 


KHUiv 






SCZD7 


GDB:9864734 


DlSORDER-2; SCZD2 


SGCG 


GDB:3763329 


MUSCULAR DYSTROPHY, LIMB GIRDLE, 

TYPE2C;LGMD2C 


SLC10A2 


GDB:677534 


SOLUTE CARRIER FAMILY 10, MEMBER 
2; SLC10A2 


SLC25A15 


GDB:120042 


HYPERORNITHINEMIA-HYPERAMMONE 

A/TTA-WnA/rnPTTPTTT T TMTTPTA QVXrnTJnA/TR 
IVJLirv-XlV^lVlVJI^Xl JVlJl-fX->lJLN UxvLnt o X In 1 JK\ JIVI n. 


STARPl 


GDB:635459 


STEROIDOGENIC ACUTE REGULATORY 
PROTEIN; STAR 


ZNF198 


GDB:6382650 


ZINC FINGER PROTEIN-198; ZNF198 



Table IS: Genes, Locations and Genetic Disorders on Chromosome 14 



Gene 


GDB Accession ID 


OMIMLink 


ACHMl 


GDB:132458 


COLORBLINDNESS, TOTAL 


ARVDl 


GDB:371339 


ARRHYTHMOGENIC RIGHT VENTRICULAR 

DYSPLASIA, FAMILIAL, 1; ARVDl 


CTAAl 


GDB:265299 


CATARACT, ANTERIOR POLAR 1; CTAAl 


DADl 


GDB:407505 


DEFENDER AGAINST CELL DEATH; DADl 


DFNB5 


GDB:636176 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 5; DFNB5 


EMLl 


GDB:6328385 


USHER SYNDROME, TYPE lA; USHl A 


GALC 


GDB: 119970 


KRABBE DISEASE 


GCHl 


GDB:1 18798 


DYSTONIA, PROGRESSIVE, WITH 
DIURNAL VARIATION GTP 
CYCLOHYDROLASE I DEFICIENCY GTP 
CYCLOHYDROLASE I; GCHl 


HEl 


GDB:9957680 


MALFORMATIONS, MULTff LE, WITH LIMB 
ABNORMALITIES AND HYPOPITUITARISM 


IBGCl 


GDB:10450404 


CEREBRAL CALCIFICATION, 
NONARTERIOSCLEROTIC 


IGH@ 


GDB:118731 


IgA CONSTANT HEAVY CHAIN 1; IGHAl 
IMMUNOGLOBULIN: D (DIVERSITY) 
REGION OF HEAVY CHAIN IgA CONSTANT 
HEAVY CHAIN 2; IGHA2 
IMMUNOGLOBULIN: J (JOINING) LOQ OF 
HEAVY CHAIN; IGHJ IMMUNOGLOBULIN: 
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Gene 


GDB Accession ID 


OMIMLint " 'u^-uo^. u^^o^-i*- 






HEAVY Mu CHAIN; Mul; IGHMl 
IMMUNOGLOBULIN: VARIABLE REGION 
OF HEAVY CHAESrS-Hvl; IGHV IgG HEAVY 
CHAIN LOCUS; IGHGl IMMUNOGLOBULIN 
Gm-2; IGHG2 IMMUNOGLOBULIN Gm-3; 
IGHG3 IMMUNOGLOBULIN Gm-4; IGHG4 
IMMUNOGLOBULIN: HEAVY DELTA 
CHAIN; IGHD IMMUNOGLOBULIN: HEAVY 
EPSILON CHAIN; IGHE 


IGHC group 


GDB:9992632 


IgA CONSTANT HEAVY CHAIN 1; IGHAl 
IgA CONSTANT HEAVY CHAIN 2; IGHA2 
IMMUNOGLOBULIN: HEAVY Mu CHAIN; 
Mul; IGHMl IgG HEAVY CHAIN LOCUS; 
IGHGl IMMUNOGLOBULIN Gm-2; IGHG2 
IMMUNOGLOBULIN Gm-3; IGHG3 
IMMUNOGLOBULIN Gm-4; IGHG4 
IMMUNOGLOBULIN: HEAVY DELTA 
CHAIN; IGHD IMMUNOGLOBULIN: HEAVY 

"CDOTT r\XT /"HIT A TKT- T/^TT17 

criSiLUJN CHAUN; XCrUE 




LrUxS.iZUUoD 


Igvj rUbSAV Y. UxiAXN LrUL/Uo; IvirlGl 


IVJXlM 


vjUx5.iZUUoD 


iMMUJNULrJAJiDULrlJN: JlxiAV Y Mu CHAIN; 
Mul; IGHMl 


XUxllv 




nt / A 1 \ c wrr^'D rwjro 
vjL(/\1 j o X JNl^KUMii 


IV 


VjLll5:lJy2/4 


IJNVxiKJSUo VloCJlKUM 






T A THrXTr TTi AXTC'd^'D"N>rTMi^ /^T> /^T17T»TT 

LAiliJNI IKANbrUJbuNllNla CjrilUWrH 
FACTOR-BETA BINDING PROTEIN 2; LTBP2 


MCOP 


GDB:9954527 


MICROPHTHALMOS 


MID 


GDB:1 18840 


MACHADO-JOSEPH DISEASE; MJD 


MNGl 


GDB:6540062 


GOITER, MULTINODULAR 1; MNGl 


MPDl 


GDB:230271 


MYOPATHY, LATE DISTAL HEREDITARY 




kjUd 04 joz 


MUUUJt'OLrYbACCJdAKiDUolo TYPE IIIC 


MixlO 




MYOSIN, HEAVY POLYPEPTIDE 6; MYH6 


MYH7 


GDB:120215 


MYOSIN, CARDIAC, HEAVY CHAIN, BETA; 

MYH7 


MP 


GDB:120239 


NUCLEOSIDE PHOSPHORYLASE; NP 


PABPNl 


GDB:567135 


OCULOPHARYNGEAL MUSCULAR 
DYSTROPHY; OPMD OCULOPHARYNGEAL 
MUSCULAR DYSTROPHY, AUTOSOMAL 
RECESSIVE POLYADENYLATE-BINDING 

r JKU I liUN -2; r Ai3r 2 






AT ZTTPTTVyTP'R TITWAQT? TTATV/TTT TAT TVPP 

AD3 


PYGL 


GDB:120328 


GLYCOGEN STORAGE DISEASE VI 


RPGMPl 


GDB:1 1498766 


AMAUROSIS CONGENITA OF LEBER I 
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Gene 


GDB AccessioalD 


OMIM Link 


QT7'PPrM'A 1 




PP OTP A QP TMUTOTTOP 1 • PT 


OIjJVriiNAO 




A T PT4" A - 1 - A"MTTPtTVTV>rOTP VPQTKT- A A r*T 


SERPINA6 


GDB: 127865 


CORTICOSTEROID-BINDING GLOBULIN; 


SLC7A7 


GDB:9863033 


DIBASICAMINOACIDUKIA H 


SPG3A 


GDB:230126 


SPASTIC PARAPLEGIA-3, AUTOSOMAL 
DOMINANT; SPG3A 


SPTB 


GDB:1 19602 


ELLPTOCYTOSIS, RHESUS-UNLINKED 
TYPE HEREDITARY HEMOLYTIC 

oxJiL^lKJiN, JtSJZ/iA, liKi ijlKUCx iiU, or ir> 


TCLIA 


GDB:250785 


T-CELL LYMPHOMA OR LEUKEMIA 


TCRAV17S1 


GDB:642130 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TCRAV5S1 


GDB:451966 


T-CELL ANTIGEN RECEPTOR, ALPHA 
SUBUNIT; TCRA 


TGMl 


GDB:125299 


TRANSGLUTAMINASE 1; TGMl 
ICHTHYOSIS CONGENITA 


TTTFl 


GDB:132588. 


THYROID TRANSCRIPTION FACTOR 1; 
TITFl 


IMIP 


GDB:9954523 


AND ULNA, DUPLICATION OF, WITH 
A'DQPxrr'p r\n ttota axtt* p atmtto 




VJUXS.1ZWU4 


T r^XJl T AXPTT/T.PXT PP/^'DPTOP ATPXTA 

1 -v>JiJ-rI/ AJN i iVjJtSJN Kns^jir 1 UJv, Al^rrlA 

SUBUNIT; TCRA 


TSHR 


GDB:125313 


THYROTROPIN. UNRESPONSIVENESS TO 


USHIA 


GDB:118885 


USHER SYNDROME, TYPE lA; USHIA 


VP 


GDB:120492 


PORPHYRIA VARIEGATA 



Table 16: Genes, Locations and Genetic Disorders on Chromosome 15 



Gene 


GDB Accession ED 


OMIM Link 


ACCPN 


GDB:5457725 


CORPUS CALLOSUM, AGENESIS OF, 
Wrra NEURONOP ATHY 


AH02 


GDB:9954535 


HEREDITARY OSTEODYSTROPHY-2; 
AH02 


ANCR 


GDB:119678 


ANGELMAN SYNDROME 


B2M 


GDB:119028 


BETA-2-MICROGLOBULIN; B2M 


BBS4 


GDB:511199 


BARDET-BIEDL SYNDROME, TYPE 4; 
BBS4 


BIM 


GDB:135698 


BLOOM SYNDROME; BLM 


CAPN3 


GDB:1 19751 


CALPAIN, LARGE POLYPEPTIDE L3; 
CAPN3 MUSCULAR DYSTROPHY, 
LIMB-GIRDLE, TYPE 2; LGMD2 
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Gene 


GDB Accession ID 




CDANl 


GDB:9823267 


DYSERYTHROPOIETIC ANEMIA, 

CONGENITAL, TYPE I 


CDANS 


GDB:386192 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE HI; CDAN3 


CLN6 


GDB:4073043 


CEROID-LIPOFUSCINOSIS, NEURONAL 6, 
LATE INFANTILE, VARIANT; CLN6 


CMH3 


GDB: 138299 


CARDIOMYOPATHY, FAMILIAL 
HYPERTROPHIC, 3; CMH3. 


CYP19 


GDB:1 19830 


CYTOCHROME P450, SUBFAMILY XK; 
CYP19 


CYPlAl 


GDB:120604 


CYTOCHROME P450, SUBFAMILY I, 

rUL^XrnrlUJiL 1; UYJrlAl 


CYP1A2 


GDB:1 18780 


CYTOCHROME P450, SUBFAMILY I, 
rULt I Jr JtSr 1 lUis z; U Ylr 1 A2 




vjjJxS.ijyi /yo 


Five T "DVT A QTmr^Tnir* i.nwi 


JtiirJo4Z 




JlJiKhiLilI AKY rLbMULY IIC FROTEIN 4,2, 
ERYTHROCYTIC; EPB42 


ETFA 


GDB:119121 


GLUTARIC ACIDURIA HA; GA IIA 






Ji y Ji CUi^UK-i , 11 Y ULi 


FAH 


GDB:119901 


TYROSINEMIA, TYPE I 


FBNl 


GDB:127115 


FIBRILLIN-1; FBNl MARFAN 
SYNDROME; MFS 


FES 


(3)B:1 19906 


V-FES FELINE SARCOMA VIRAI7V-FPS 
FUJINAMI AVIAN SARCOMA VIRAL 

UJNL/UCjJbJNJi 




/^r\lD . 1 1 0^2 f\/Z 


CUKONAViRUS 229E SUSCEPTIBILITY; 
CVS 


TXPY A 

i±cAA 


LxUJd:12UU4U 


TAY-SACHS DISEASE; TSD 






TO 7 A T "DTIT/^ A /Trr\T31W JTT A . T\ 7* A 

ioUVALERiCACIDEMIA^ IV A 






CnULESTASlS-LYMPHEDEMA 
SYNDROME 


UPC 


GDB:1 19366 


LIPASE, HEPATIC; LIPC 


MY05A 


GDB:218824 


MYOSIN VA;MY05A 


OCA2 


GDB:136820 


ALBINISM n 




nT»T} • QC/;r\/i '7'3 
CjJJrs .yooU4 / J 


U 1 UoCLfEKUbio 


PWCR 


GDB: 120325 


PRADER-WILLI SYNDROME 


RLBPl 


GDB:127341 


RETINAIJ^EHYDE-BINDING PROTEIN 1,; 


SLC12A1 


(H)B:386121 


SOLUTE CARRIER FAMILY 12, MEMBER 
1; SLC12A1 


SPG6 


GDB:5 11201 


SPASTIC PARAPLEGIA 6, AUTOSOMAL 
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Gene 


GDB Accession D> 


OMIMLUr" ^-J.^^^ 






DOMINANT; SPG6 


TPMl 


GDB:127875 


TROPOMYOSIN 1; TPMl 


UBE3A 


GDB:228487 


ANGELMAN SYNDROME 
UBIQUmN-PROTEIN UGASE E3>^ 

UBE3A 


WMS 


GDB:5583902 


WEEX-MARCHESANI SYNDROME 



Table 17: Genes, Locations and Genetic Disorders on Chromosome 16 



Gene 


GDB Accession ID 


OMIMLink 


ABCC6 


GDB:9315106 


PSEUDOXANTHOMA ELASTICUM, 
AUTOSOMAL DOMINANT; PXE 
Ji'DiiUDUAAN IxiUMA biJ\i>TlCuM, 
AUTOSOMAL RECESSIVE; PXE 


A T T\C\ A 




ALDOLASE A, FRUCTOSE-BISPHOSPHATE; 
ALDOA 


APRT 


GDB:1 19003 


ADENINE 

PHOSPHORIBOSYLTRANSFERASE; APRT 


A1P2A1 


GDB: 119716 


ATPase, Ca(2+)-TRANSP0RTING, 
FAST-TWITCH 1; ATP2A1 BRODY 
MYOPATHY 


BBS2 


GDB:229992 


BARDET-BIEDL SYNDROME, TYPE 2; BBS2 


CARD15 


GDB: 11026232 


SYNOVITIS, GRANULOMATOUS, WITH 
UVEITIS AND CRANIAL NEUROPATHIES 
REGIONAL ENTERITIS 


CATM 


GDB:701219 


MICROPHTHALMIA-CATARACT 


CDHl 


GDB: 120484 


CADHERIISri;CDHl 


CETP 


GDB: 119773 


CHOLESTERYL ESTER TRANSFER 
PROTEIN, PLASMA; CETP 


CHST6 


GDB:131407 


CORNEAL DYSTROPHY, MACULAR TYPE 


CLN3 


GDB: 120593 


CEROID-LPOFUSCINOSIS, NEURONAL 3, 
JUVENILE; CLN3 


CREBBP 


GDB:437159 


RUBINSTEIN SYNDROME CREB-BINDING 
PROTEIN; CREBBP 


CTH 


GDBrl 19086 


CYSTATHIONINURIA 


CTM 


GDBrl 19819 


CATARACT, ZONULAR 


CYBA 


GDB:125238 


GRANULOMATOUS DISEASE, CHRONIC, 
AUTOSOMAL CYTOCHROME-b-NEGATIVE 

FORM 


CYLD 


GDB:701216 


EPITHELIOMA, HEREDITARY MULTff LE 
BENIGN CYSTIC 


DHS 


GDB:9958268 


XEROCYTOSIS, HEREDITARY 
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Gene 


GDB Accession ID 


OMIMLuik 


DNASEl 


GDB:132846 


DEOXYRIBOISIUCLEASE I; DNASEl 


DPEPl 


GDB:128059 


RENAL DIPEPITDASE 


ERCC4 


GDB:119113 


EXCISION-REPAIR, COMPLEMENTING 

DEFECTIVE, IN CHINESE HAMSTER, 4; 
ERCC4 XERODERMA PIGMENTOSUM, 

l-/l^l\lrJLJaiyulJNlAllLrJ^ xjtsXJUr X*; Arr 






TTA'NTr'OXrT AXTPAyTTA r*01V>rDT T3A>rDTVTT A TTr^XT 
r AIN \^\JJN1 AJNxiJVllA, UUlVLr JLlilXliijN 1 A i lUJN 

GROUP A; FACA 






iViUUUJrUL/i oAL.UilAJt<JJJUo 1 irii IVA 


VJAJN 




XTT7T rD /^"D A TTUV /TIT A "KIT" A V/^xT A T . A XT 

JNiiUKUJrAlxlY, ijrJLAJN i AAUNAL; CjAN 


XXrWJJCl 


wLio . 1 1 y^yj, 


rl I JjKUA Y AC I Lr vjLU I Ai xUUJNli 
HYDROLASE; HAGH 


TTR A 1 




TTT?A/r/^<^T r^'DTXT AT "DtJ A T r\r*TTC 1 . TXD A 1 
xlJiML/01^iJJt51JN--AJUrxlA LfUUUo-1; HKAl 


TTR AO 


LriJiJ . i 1 yzy^ 


TJ^P'\/[^\^^T ^mutm at ■dxj a t nr^iiQ o. tjtd a o 
JtlJQMULrlj\Jx5 iJN — AJUrxlA i^UU U o -2; H KAz 


TTDTJT? 
XlDXUX 


riT\n 'QQKA^A 1 
\JUl> 5rD*frD4 1 


i±bMUlji^Ui51iN Jti-KiiLAi JiD MJbjN 1 AL 
RETARDATION 


HBQl 


GDB:120036 


HEMOGLOBIN-THETA-1 LOCUS; HBQl 


HBZ 


GDB:119302 


HEMOGLOBIN-ZETA LOCUS; HBZ 






JtlJclMy Vj1-»L^ i5ilN'--^Ji 1 A LrUU U o; JblU^ 


FTP 


\jiJo,i lyom- 


TTAPTnriT ^^TITXT* XTD 
xlAJr 1 UVJiAJisUN, xir 


XlOJLI 1 i J>Z 




r^r\'D'T'iQr\T ii "dtjta 'i7"mn#^'D'nT\TT/^'T'Ao*c 
i^UKXloUi^ ll-liiilA-JsJcJiUKilDUClAoBL 

DEFICIENCY 


IL4R 


GDB:1 18823 


INTERLEUKIN-4 RECEPTOR; IL4R 


T TPn 

JLJUl Jl5 




T TD A CD "D T VOf^C/^AvT AT A /^TTV. T TT>*D 

JjJUrAbJl rJ, IjI aUoUMAJL ACID; J_,lrJD 


MCIR 


GDB:135162 


MELANOCORTIN-1 RECEPTOR; MCIR 


MEFV 


GDB:125263 


MEDITERRANEAN FEVER, FAMILIAL; 
MEFV 


MHC2TA 


GDB:6268475 


MHC CLASS n TRANSACTIVATOR; 

MHC2TA 


MLYCD 


GDB:11500940 


MALONYL CoA DECARBOXYLASE 
DEFICIENCY 


PHKB 


GDB:120286 


PHOSPHORYLASE KINASE, BETA 
SUBUNTF; PHKB 


PHKG2 


GDB:140316 


PHOSPHORYLASE KINASE, TESTIS/UVER, 

GAMMA2;PHKG2 


PKDl 


GDB:120293 


POLYCYSTIC KIDNEYS POLYCYSHC 
KIDNEY DISEASE 1 ; PKDl 


PKDTS 


GDB:9954545 


POLYCYSTIC KIDNEY DISEASE, 
INFANTILE SEVERE, WITH TUBEROUS 
SCLEROSIS; 


PMM2 


GDB:438697 


CARBOHYDRATE-DEFICIENT 
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Gene 


GDB Accession ID 


OMIMLink 






GLYCOPROTEIN SYNDROME, TYPE I; 
CDGl PHOSPHOMANNOMUTASE 2; PMM2 


PXE 


GDB:6053895 


PSEUDOXANTHOMA ELASTICXJM, 
AUTOSOMAL DOMrNANT; PXE 
PSEUDOXANTHOMA ELASTICUM. 
AU lU^UMAL JvJtSL^xSoMVji;. irAJi 


SALLl 


GDB:4216161 


TOWNES-BROCKS SYNDROME; TBS 

CAT T Ten? 1 . C A T T 1 

ISAL-LJLlSJb 1; bAJjJLl 


SCA4 


GDB:250364 


SPDSfOCEREBELLAR ATAXIA 4; SCA4 


SCNNIB 


GDB:434471 


SODIUM CHANNEL, NONVOLTAGE-GATED 

1 BETA; bCNMlB 


SCNNIG 


GDB:5o8759. 


SODIUM CHANNEL, NONVOLTAGE-GATED 
1 GAMM^ SCNNIG 


TAT 


GDB: 120398 


TYROSINE TRANSAMINASE DEFICIENCY 








VDI 


GDB: 119629 


DEFECTIVE INTERFERING PARTICLE 
INDUCTION, CONTROL OF 


WT3 


GDB:9958957 


WILMS TUMOR, TYPE HI; WT3 



Table 18: Genes, Locations and Genetic Disorders on Chromosome 17 



Gene 


GDB Accession ED 


OMTMLink 


ABR 


GDB: 119642 


ACTIVE BCR-RELATED GENE; ABR 


ACACA 


GDB: 120534 


ACETYL-CoA CARBOXYLASE 
DEnCIENCY 


ACADVL 


GDB: 1248185 


ACYL-CoA DEHYDROGENASE, 
VERY-LONG-CHAIN, DEFICIENCY OF 


ACE 


GDB: 119840 


DIPEPTIDYL CARBOXYPEPTIDASE-1; 
DCPl 


ALDH3A2 


GDB: 1316855 


SJOGREN-LARSSON SYNDROME; SLS 


APOH 


GDB:1 18887 


APOLIPOPROTEIN H; APOH 


ASPA 


GDB:231014 


SPONGY DEGENERATION OF CENTRAL 

NERVOUS SYSTEM 


Axnsf2 


GDB:9864782 


CANCER OF COLON 


BCL5 


GDB:125178 


LEUKEML\/LYMPHOMA, CHRONIC 
B-CELL, 5; BCL5 


BHD 


GDB: 11498904 


WITH TEaCHODISCOMAS AND 
ACROCHORDONS 


BLMH 


GDB:3801467 


BLEOMYCIN HYDROLASE 


BRCAl 


GDB:126611 


BREAST CANCER, TYPE 1; BRCAl 


CACD 


GDB:5885801 


CHOROIDAL DYSTROPHY, CENTRAL 
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AREOLAR; CACD 


CCAl 


GDB: 118763 


CATARACT, CONGEMTAL, CERULEAN 
TYPE 1; CCAl 


CCZS 


GDB:681973 


CATARACT, CONGENITAL ZONULAR, 
WITH SUTURAL OPACITIES; CCZS 


CHKNBl 


GDB; 120587 


CHOLINERGIC RECEPTOR, NICOTINIC, 
BETA POLYPEPTIDE 1; CHRNBl 


CHKNE 


GDB:132246 


CHOLINERGIC RECEPTOR, NICOTINIC, 

EPSILON POLYPEPTIDE; CHRNE 


CMTIA 


GDBrl 19785 


CHARCOT-MARIE-TOOTH DISEASE, 
TYPE lA; CMTl A NEUROPATHY, 
HEREDITARY, WITH LIABILITY TO 
PRESSURE PALSIES; HNPP 


COLlAl 


GDB:1 19061 


COLLAGEN, TYPE I, ALPHA-1 CHAIN; 
COLlAl OSTEOGENESIS IMPERFECTA 
TYPE I OSTEOGENESIS IMPERFECTA 

TVPT7 A/« r\fA 








CTNS 


GDB:700761 


CYSTINOSIS, EARLY-ONSET OR 
INFANTILE NEPHROP ATHIC TYPE 


EPX 


GDB:377700 


EOSINOPHIL PEROXIDASE; EPX 


ERBB2 


GDB:120613 


V-ERB-B2 AVIAN ERYTHROBLASTIC 

T "RTTK'IhIV/IT A "V/TDAT OXir^/^r^TTXTC? 
J-fJCfUiSJilVlLfV ViK/\-L UiNUUvJXiJNJCf 

HOMOLOG2;ERBB2 


G6PC 


(a)B:231927 


GLYCOGEN STORAGE DISEASE I; GSD-I 


GAA 


GDBrl 19965 


GLYCOGEN STORAGE DISEASE H 




VjrJL/JD . 1 i j'ZH'O 


n\l A r^TOVTM ACT? l^'nxrT/^TT7KT/'''''V' 






^J-fUUAVjnJJN KliUiir i UK; LrL/V_rK 


GFAP 


rTn'R'i 1 R700 


IT-T TAT T7TRTJTT T AT>V A r^TTlTO DT> r\TrT3'TXT. 

GFAP ALEXANDER DISEASE 


GHl 


GDB: 119982 


GROWTH HORMONE 1; GHl 


GH2 


GDB: 119983 


GROWTH HORMONE 2; GH2 


GPIBA 


GDB: 11 8806 


GL^T PLATELET SYNDROME 


GPSC 


GDB:9954564 


FAMILIAL PROGRESSIVE SUBCORTICAL 


GUCY2D 


GDB:136012 


AMAUROSIS CONGENITA OF LEBER I 
GUANYLATE CYCLASE 2D, MEMBRANE; 
GUC2D CONE-ROD DYSTROPHY-6; 


ITGA2B 


GDB:120012 


THROMBASTHENIA OF GLANZMANN 
ANDNAEGELI 


ITGB3 


GDB:120013 


INTEGRIN, BETA-3; ITGB3 


ITGB4 


GDB:128028 


INTEGRIN, BETA-4; ITGB4 
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Gene 


GDB Accession ID 


OMIMLink 


KRTIO 


GDB:118828 


KERATIN 10; KRTIO 


KRT12 


GDB:5583953 


CORNEAL DYSTROPHY, JUVENILE 
EPriHELIAL, OF MEESMANN BCERATDSf 

1 O' "CT'DTI O 
iZ, JSJvi iZ 


T7"T>T'1 O 

KKl 13 


LrUiJ.iZU/4U 


JvcKAlUN IJ, JsJvl 13 


KKl 14 




isJiKAiUN 14, JsJ<.i 14 CjLUI ArHlONE 
SYNTEJETASE; GSS 


KRT14L1 


GDB:120121 


KERATIN 14; KRT14 


KRT14L2 


GDB:120122 


KERATIN 14; KRT14 


KRT14U 


GDB:120123 


KERATIN 14; KRT14 


KRT16 


GDB:136207 


KERATIN 16; KRT16 


KRTloLl 


uDB:120125 


TT'Ull A '1*1 KT 1 iC. TmT^'t H 

KERATIN 16; KRTlo 


KRT16L2 


GDB:120126 


KERATIN 16; KRT16 


KRT17 


GDB:136211 


KERATIN 17; KRTl 7 PACHYONYCHIA 
CONGENITA, JACKSON-LAWLER TYPE 


KRT9 


GDB:303970 


HYPERKERATOSIS, LOCALIZED 
EPIDERMOLYTIC 


MAPT 


GDB: 119434 


MICROTUBULE-ASSOCIATED PROTEIN 
TAU; MAPT PALLIDOPONTONIGRAL 
DEGENERATION; PPND 
DISlNHIBiriON-DEMENTIA-PARKINSONI 
SM-AMYOTROPHY COMPLEX; DDPAC 


MDB 


GDB:9958959 


MEDULLOBLASTOMA; MDB 


MDCR 


GDB:120525 


MILLER-DIEKER LISSENCEPHALY 
SYNDROME; MDLS 
PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, GAMMA SUBUNIT 


MGI 


GDB:9954550 


MYASTHENIA GRAVIS, FAMILIAL 
INFANTILE; FIMG 


MHS2 


GDB: 132580 


MALIGNANT HYPERTHERMIA 
SUSCEPTIBILITY-2; MHS2 


MKSl 


GDB:681967 


MECKEL SYNDROME; MKS 


MPU 


GDB: 120192 


MYELOPEROXIDASE DEFICIENCY 


TlyTT TT 


GDB:o3o050 


MULIBREY NANISM; MUL 


MY015A 


GDB:9838006 


DEAFNESS, NEUROSENSORY, 
AUTOSOMAL RECESSIVE, 3; DrNB3 


XT A T T 


GDB:63o533 


MUCOPOLYSACCHARIDOSIS TYPE HLB 


NAPB 


GDB:9954572 


NEURITIS WITH BRACHIAL 


NFl 


GDB:120231 


NEUROFIBROMATOSIS, TYPE t NFl 


NMEl 


GDB:127965 


NON-METASTATIC CELLS 1, PROTEIN 
EXPRESSED IN; NMEl 
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Gene 


GDB Accession ID 


OMIMi:inr - 


P4HB 


GDB:120708 


PROLYL-4-HYDROXYLASE, BETA 
POLYPEPTIDE; PHDB; PROHB 


PAFAHIBI 


GDB:677430 


MILLER-DIEKER LISSENCEPHALY 
SYNDROME; MDLS 
PLATELET-ACTIVATING FACTOR 
ACETYLHYDROLASE, GAMMA SUBUNTT 


PECAMl 


GDB:696372 


PLATELET-ENDOTHELIAL CELL 
AUiUloHJJN MULcLULii; rMCAMl 


PEX12 


GDB:6155804 


ZELLWEGER SYNDROME; ZS 
PEROXIN-12;PEX12 


PHB 


GDB:126600 


PROHIBITIN; PHB 


PMP22 


GDB:134190 


CHARCOT-MARIE-TOOTH DISEASE 
TYPE lA; CMTIA HYPERTROPHIC 
NEUROPATEIY OF DEJERINE-SOTTAS 
PERIPHERAL MYELIN PROTEIN 22; 
PMP22 


PRKARIA 


GDB: 120313 


MYXOMA. SPOTTY PIGMENTATION 
AND ENDOCRINE OVERACTIVITY 
PROTEIN KINASE, cAMP-DEPENDENT, 
REGULATORY, TYPE I, ALPHA; 

"DTD "f A "D 1 A 






"DO /^TTTTXT T/TVT A OH? AT "DUT A . TH^TT"/^ A 


■D"D V A 

rxClvWJNJS.4 




i'biiUDUllYrUALDUiSlBivUNloM TYPE II, 
LOCUS B;PHA2B 


PRP8 


GDB:9957697 


RETINITIS PIGMENTOSA-13; RP13 


rKrro 




KblUNliib rloMliJNlUoA-I3; KP13 


r ILAH 


LrDi5:yy 57342 


A T>T A OT A /^T> TTTTT^/^TiT A O T A 

APLASIA OR HYPOPLASIA 


RARA 


GDB:120337 


RETDSrOIC ACID RECEPTOR, ALPHA; 

T> A "O A 

RARA 


KUVl 


f^T\ty .IOC Ann 

u1JJd:135477 


RECOVERIN; RCVl 


"DXiTO A 1 

KMdAl 


oiJxj:304519 


REGULATOR OF MITOTIC SPINDLE 
ASSEMBLY 1; RMSAl 


RP17 


GDB:683199 


RETINmS PIGMENTOSA-17; RP17 


RSS 


GDB:439249 


RUSSELL-SILVER SYNDROME; RSS 


SCN4A 


GDB:125181 


PERIODIC PARALYSIS H 


SEKPINF2 


GDB:120301 


PLASMIN INHIBirOR DEnCIENCY 






A T\TJ A T TKT. A T^T 

ADHALIN; AuL 


SGSH 


GDB:13 19101 


MUCOPOLYSACCHARIDOSIS TYPE lEA 


SHBG 


GDB: 125280 


SEX HORMONE BINDING GLOBULIN; 
SHBG 


SLC2A4 


GDB:119997 


SOLUTE CARRIER FAMILY 2, MEMBER 

4; SLC2A4 
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Gene 


GDB Accession ID 


OMEMLink 


SLC4A1 


GDB: 119874 


SOLUTE CAREUER FAMILY 4, ANION 
EXCHANGER, MEMBER 1; SLC4A1 
BLOOD GROUP-DIEGO SYSTEM; DI 
BLOOD GROUP-WRIGHT ANTIGEN; Wr 
ELLIPTOCYTOSIS, RHESUS-UNLINKED 

1 I JrJc HXiKJBjJiiAKl HK.MIH.Y 11^ 




ljJDJt5:i34/lJ 


C/^T TTTTi ATJUTDD UATkiTTr V /I AyTDA^TDms 

dUL/U 1 U UAJKKlcK rAMULY 0, MxlMisliR 
4; SLC6A4 


SMCR 


GDB:120379 


SMITH-MAGENIS SYNDROME; SMS 


SOST 


GDB: 10450629 


SCLEROSTEOSIS 


S0X9 


GDB:134730 


DYSPLASIA 


SSTR2 


GDB:13418o 


SOMAlObTATINRECEPTOR-2; SSTR2 


SYMl 


GDB:512174 


SYMPHALANGISM, PROXIMAL; SYMl 


SYNSl. 


GDB:9862343 


SYNOSTOSES, MULTIPLE, WITH 
BRACHYDACTYLY 


TCF2 


GDB: 125298 


TRANSCRIPTION FACTOR-2, HEPATIC; 

TCF2 


THRA 


GDB:120730 


THYROID HORMONE RECEPTOR, ALPHA 
1;THRA 


TIMP2 


GDB: 132612 


nSSUE INHIBirOR OF 
METALLOPROTEINASE-2; TIMP2 


TOC 


GDB:451978 


TYLOSIS WITH ESOPHAGEAL CANCER; 
TOC 


TOP2A 


GDB:1 18884 


TOPOISOMERASE (DNA) II, ALPHA; 
T0P2A 


TP53 


GDB:120445 


CANCER, HEPATOCELLULAR 
LI-FRAUMENI SYNDROME; LFS TUMOR 
PROTEIN p53; TP53 CARCINOMA 


VBCH 


GDB:9954554 


HYPEROSTOSIS CORTICALIS 
GENERALISATA 



Table 19: Genes, Locations and Genetic Disorders on Chromosome 18 



Gene 


GDB Accession ID 


OMIMLinIc 


ATP8B1 


GDB:453352 


CHOLESTASIS, PROGRESSIVE FAMILIAL 
INTllAHEPATIC 1; PFICl INTRAHEPATIC 
CHOLESTASIS FAMILLVL INTRAHEPATIC 
CHOLESTASIS-l;nCl 


BCL2 


GDB:119031 


B-CELL CLL/LYMPHOMA 2; BCL2 


CNSN 


GDB:9954580 


CARNOSINEMIA 


CORDl 


GDB:118773 


CONE-ROD DYSTROPHY-1; CORDl 


CYB5 


GDB:125236 


MEIHEMOGLOBINEMIA DUE TO 
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Gene 


GDB Accession ID 


UMIM Link 






DEnCffiNCY OF CYTOCHROME b5 


DCC 


GDB:1 19838 


DELETED IN COLORECTAL CARCINOMA; 


F5F8D 


GDB:6919858 


FACTOR V AND FACTOR Vm, COMBINED 
DiirlCiciNUY Ur; r5roD 


FECH 


GDB: 127282 


rKOTOPORPHYRL^, ERYTHROPOIETIC 


FEO 


GDB:4378120 


POLYOSTOTIC OSTEOLYTIC DYSPLASIA, 
HEREDirARY EXPANSILE; HEPOD 


LAMA3 


GDB:251818 


LAMININ, ALPHA 3; LAMAS 


LCFS2 


GDB:9954578 


CANCER 


MADH4 


GDB:4642788 


POLYPOSIS, JUVENILE INTESTINAL 
MOTHERS AGAINST DECAPENTAPLEGIC, 
DROSOPHILA, HOMOLOG OF, 4; MADH4 


MAFDl 


GDB:120163 


MANIC-DEPRESSIVE PSYCHOSIS, 
AUTOSOMAL 


MC2R 


GDB:1351o3 


ADRENAL UNRESPONSIVENESS TO ACTH 


MCL 


GDB:9954574 


LEIOMYOMATA, HEREDITARY MULTIPLE, 
OF SKIN 


MYP2 


GDB:9862232 


MYOPIA 


NPCl 


GDB:138178 


NIEMANN-PICK DISEASE, TYPE CI; NPCl 


SPPK 


GDB:o06444 


PALMOPLANTARIS STRIATA 


TQrBRE 




IKAJNorUKJVLLNCj CjKUWlxlrAUlOK, BxSlA 
1 RESPONSE ELEMENT 


TGIF 


GDB:9787150 


HOLOPROSENCEPHALY, TYPE 4; HPE4 


TTR 


GDB:1 19471 


TRANSTHYRETIN; TTR 



Table 20: Genes, Locations and Genetic Disorders on Chronaosome 19 



Gene 


GDB Accession ID 


OMIMLink 


AD2 


GDB:118748 


ALZHEIMER DISEASE-2; AD2 


AMH 


GDB:118996 


PERSISTENT MULLERIAN DUCT 
SYNDROME, TYPES I AND E; PMDS 
ANTI-MULLERIAN HORMONE; AMH 


AP0C2 


GDB: 119689 


APOLff OPROTEIN C-H DEHCIENCY, 
TYPE I HYPERLIPOPROTEINEMIA DUE 
TO 


APOE 


GDB:1 19691 


APOLIPOPROTEIN B; APOE 


ATHS 


GDB:128803 


LP OPROTEIN PHENOTYPE; ALP 


BAX 


GDB:228082 


BCL2-ASS0CL\TED X PROTEIN; BAX 


BCKDHA 


GDB:1 19723 


MAPLE SYRUP URINE DISEASE 
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Gene 


GDB Accession ID 




BCL3 


GDB: 120561 


B-CELL LEUKEMIA/LYMPHOMA-3; 


BFIC 


GDB:9954584 


BENIGN FAMILIAL INFANTILE 
CONVULSIONS 


C3 


GDB:119044 


COMPLEMENT COMPONENT-3; C3 


CACNAIA 


GDB: 126432 


ATAXIA, PERIODIC 

VESTIBULOCEREBELLAR HEMIPLEGIC 
MIGRAINE, FAMILIAL; MHP 
SPINOCEREBELLAR ATAXIA 6; SCA6 
CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, P/Q TYPE, 
ALPHA lA SUBUMT; CACNAIA 


ceo 


GDB:1 19755 


CENTRAL CORE DISEASE OF MUSCLE 


CEACAM5 


GDB:1 19054 


CARCINOEMBRYONIC ANTIGEN; CEA 


COMP 


GDB:344263 


EPIPHYSEAL DYSPLASIA, MULTIPLE; 
MED PSEUDOACHONDROPLASTIC 
DYSPLASIA CARTILAGE OUGOMERIC 
MATRIX PROTEIN; COMP 


CSX 


GDB:333932 


CONE-ROD DYSTROPHY-2; CORD2 
AMAUROSIS CONGENITA OF LEBER I 
CONE-ROD HOMEO BOX-CONTAINING 


DBA 


GDB:9600353 


ANEMIA, CONGENTIAL HYPOPLASTIC, 
OF BLACKFAN AND DIAMOND 


DDU 


GDB:10796026 


URTICARIA; DDU 


DFNA4 


GDB:606540 


DEAFNESS, AUTOSOMAL DOMINANT 
NONSYNDROMIC SENSORINEURAL, 4; 

DFNA4 


DLL3. 


GDB:9959026 


VERTEBRAL ANOMALIES 


DMPK 


Ca)B:l 19097 


DYSTROPHIA MYOTONICA; DM 


DMWD 


CT>B:7178354 


DYSTROPHIA MYOTONICA; DM 


DPDl 


GDB:1079ol70 


engelmajnn disease 




(jDJB.liyiUl 


ECHO 11 SENSrriVITY; El IS 


ELA2 


GDB:1 18792 


ELASTASE-2; ELA2 NEUTROPENIA, 
CYCLIC 


EPOR 


GDB: 125242 


ERYTHROPOIETIN RECEPTOR; EPOR 


ERCC2 


GDB:119112 


EXCISION-REPAIR, COMPLEMENTING 
DEFECTIVE, IN CHINESE HAMSTER, 2; 
ERCC2 XERODERMA PIOVIENTOSUM 


ETFB 


GDB: 119887 


ELECTRON TRANSFER FLAVOPROTEIN, 
BETA POLYPEPTIDE; ETFB 


EXT3 


GDB:383780 


EXOSTOSES, MULTIPLE, TYPE HI; EXT3 
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Gene 


GDB Accession ID 


OMIM Link 


EYCLl 


GDB:1 19269 


EYE COLOR-1; EYCLl 


FTL 


GDB:1 19234 


FERRTTIN UGHT CHAIN; FTL 


FUTl 


GDB:120618 


FUCOSYLTRANSFERASE-1; FUTl 


T7TTT0 






rUlO 


r'rip.i qci on 

KJUD, xjj loKJ 




VXnJyLl 




rSf T AMTnTTsin A PPT A TP 

METEIYLTRANSFERASE; GAMT 








Ijrrl 


UJJjD.lzUUlD 




vjUoJM 


riTlTl'l 1 O0O1 


Lri-»UUUKUJNJJJAoJti, JVLL/UoJi, MUDlricK 

OF; GUSM 


rLbl 




XDUJNJJi-fXi liKAiNCri 15i^\JL^Js. 


HCLl 


GDB: 119304 


HAIR COLOR-l; HCLl 


HHC2 


GDB:249836 


HYPOCALCIURIC HYPERCALCEMIA, 
FAMILIAL, TYPE E; HHC2 


HHC3 


GDB:9955121 


HYPOCALCIURIC HYPERCALCEMIA, 

T7AA/rTT TAT TVT>P TTT* VTUC^'X 




nFin • 1 '3 o 'J 

KJUd. I DO J, JQ 


TXTTT71>r^lHT T ITT AT? AT^TTRCTOXT 

MOLECULE-3; ICAM3 


INSR 


GDB:1 19352 


INSULIN RECEPTOR; INSR 


JAJvi 


KjUd.d /04oU 


TAXTTTC ■p'TKTAC'P 1 TAV^ 


KLK3 


GDB: 119695 


ANTIGEN, PROSTATE-SPECFIC; APS 


LDLR 


GDB:1 19362 


HYPERCHOLESimOLEMIA, FAMILIAL; 


LHB 


GDB:1 19364 


LUTEINIZING HORMONE, BETA 

VDT?T>TTr\T7« T tTD 

JrUJLr X Jrii£^ 1 UJxi, JLJclx5 






T Xn A CT7 T T\XT A A TT> r^T7T>T7XTr^T7XTT« T Tdl 


L0H19CR1 


GDB:9837482 


ANEMIA, CONGENITAL HYPOPLASTIC, 

r^ij BT An?"irA'M' Avrrv'TiTAAyr/^xm 
Ur JdJLA.I^JsJ' AJN ArilJ ULAMKJrijJ 


T VT 1 


um5:izuiDo 


T UnVTSAifTA T "VT^/TOUnm 1 • T VT 1 

Liii U JvxiiVLlA, i^xMrJtlUlL', i; JLi JLi 


MAN2B1 


GDB:1 19376 


MANNOSIDOSIS, ALPHA B, 
LYSOSOMAL 


MCOLNl 


0)8:10013974 


MUCOLIProOSIS IV 


MDRV 


GDB:6306714 


MUSCULAR DYSTROPHY, AUTOSOMAL 
DOMINANT, WITH RIMMED 
VACUOLES; MDRV 


MLLTl 


GDB:136791 


MYELOID/LYMPHOID OR MIXED 
LINEAGE LEUKEMIA, TRANSLOCATED 
TO, 1; MLLTl 


N0TCH3 


GDB:361163 


DEMENTIA, HEREDITARY 
MULTI-INFARCT TYPE NOTCH, 
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Gene 


GDB Accession n> 


OMIMLink 






DROSOPHILA, HOMOLOG OF, 3; 


NPHSl 


GDB:342105 


NEPHROSIS 1, CONGENITAL, FINNISH 

'T'"\7t:>'c- xtdxtci 
1 Xrh, JNrxlol 


(JFC3 


vjrJJU.lZoUOU 




OPA3 


<^TM5 . QO < /I Q 

ujj J5 : yy J 4 J y u 


UPilC AiKUJrJli, UNrAJNliLiS, WIIjI 

CHOREA AND SPASTIC PARAPLEGIA 


PEPD 


GDB:120273 


PEPTIDASE D; PEPD 


PRPF31 


GDB:333911 


RETINlTllS riCjJVlbM 1 OSA 1 1 ; RPl 1 


PRTN3 


GDB:126876 


PROTEINASE 3; PRTN3; PR3 


PRX 


GDB:11501256 


HYPERTROPHIC NEUROPATHY OF 
DEJERINE-SOTTAS 


PSGl 


GDB: 120321 


PREGNANCY-SPECMC 
BETA-l-GLYCOPROTEIN 1; PSGl 


PVR 


GDB: 120324 


POLIOVmUS SUSCEPTIBILITY, OR 

sENsrrivrrY; pvs 


RYRl 


GDB:120359 


CENTRAL CORE DISEASE OF MUSCLE 
HYPERTHERMIA OF ANESTHESL^. 
KYANUDUNil KUL-Jlr 1(JK-1; KYKl 


SLC5A5 


GDB:5892184 


SOLUTE CARRIER FAMILY 5, MEMBER 
5; SLC5A5 


SLC7A9 


GDB:9958852 


CYSTINURIA, TYPE HI; CSNU3 


STXll 


GDB:9732383 


PEUTZ-JEGHERS SYNDROME 
SERINE/THREONINE PROTEIN KINASE 
11;STK11 


TBXA2R 


GDB:127517 


THROMBOXANE A2 RECEPTOR, 
PLATELET; TBXA2R 


TGFBl 


GDB:120729 


ENGELMANN DISEASE 
mANSFORMING GROWTH FACTOR, 
BETA-1; TGFBl 


TNNB 


GDB:125309 


TROPONIN I, CARDIAC; TNNB 


TYROBP 


GDB:9954457 


POLYCYSTIC LIPOMEMBRANOUS 
OSTEODYSPLASIA WITH SCLEROSING 
LEUKOENCEPHALOPATHY 



Table 2 1 : Genes, Locations and Genetic Disorders on Chromosome 20 



Gene 


GDB Accession ID 


OMIMLink 


ADA 


GDBrl 19649 


ADENOSINE DEAMINASE; ADA 


AHCY 


GDB:1 18983 


S-ADENOSYLHOMOCYSTEINE 
HYDROLASE; AHCY 


AVP 


GDB: 119009 


DIABETES INSIPIDUS, 
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Gene 


GDB Accession ID 


OMIMLmk 






NEUROHYPOPHYSEAL TYPE ARGININE 
VASOPRESSIN; AVP 


CDAN2 


GDB:9823270 


DYSERYTHROPOIETIC ANEMIA, 
CONGENITAL, TYPE H 


CDMPl 


GDB:438940 


CHONDRODYSPLASIA, GREBE TYPE 
CARTILAGE-DERIVED MORPHOGENETIC 
PROTEIN 1 


CHEDl 


GDB:3837719 


CORNEAL DYSTROPHY, CONGENITAL 
ENDOTHELIAL; CHED 


CHRNA4 


GDB:128169 


CHOLINERGIC RECEPTOR, NELIRONAL 
NICOTINIC, ALPHA POLYPEPTIDE 4; 
CHRNA4 EPILEPSY, BENIGN NEONATAL; 
BBNl 


CST3 


GDB:119817 


AMYLOIDOSIS VI 


EDN3. 


GDB:1 19862 


ENDOTHELIN-3; EDN3 
WAARDENBURG-SHAH SYNDROME 


EEGVl 


GDB: 127525 


ELECTROENCEPHALOGRAM, 
LOW-VOLTAGE 


FTLLl 


GDB:1 19235 


FERRITIN LIGHT CHAIN; FTL 


GNAS 


GDB:120628 


GUANINE NUCLEOTIDE-BINDING 
PROTEIN, ALPHA-STIMULATING 
POLYPEPTIDE; 


GSS 


GDB:637022 


GLUTATHIONE SYNTHETASE 
DEHCIENCY OF ERYTHROCYTES, 
HEMOLYTIC ANEMIA 
PYROGLUTAMICACIDTIRIA 
HNF4AGDB:393281DIABETES MELLITUS, 
AUTOSOMAL DOMINANT 
TEIANSCRIPTION FACTOR 14, HEPATIC 
NUCLEAR FACTOR; TCF14 


JAGl 


GDB:6175920 


CHOLESTASIS WITH PERIPHERAL 
PULMONARY STENOSIS JAGGED 1; JAGl 


KCNQ2 


GDB:9787229 


EPILEPSY, BENIGN NEONATAL; EBNl 
POTASSIUM CHANNEL, 
V0LTAGE-GATH3, SUBFAMILY Q, 
MEMBER 2 


MKKS 


GDB:9860197 


HYDROMETROCOLPOS SYNDROME 


NBIAl 


GDB:4252819 


HALLERVORDEN-SPATZ DISEASE 


PCKl 


GDB: 125349 


PHOSPHOENOLPYRUVATE 
CARBOXYKINASE 1, SOLUBLE; PCKl 








PPGB 


GDB: 119507 


NEURAMINIDASE DEHCIENCY WITH 
BETA-GALACTOSIDASE DEFICIENCY 


PPMD 


GDB:702144 


CORNEAL DYSTROPHY, HEREDITARY 
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trene 


\yMJi3 /vccession jlu 


1 ^ 1^ " ^ ^ ^ 'Tui bgu u'- ic:;^ ^gu lui — i 
IJlYXJUyX IjIIIK 










GDB* 120720 


GERSTMANN-STRAUSSLER DTSF ASE- 
GSD PRION PROTEIN; PRM> 


THBD 


GDB:119613 


THROMBOMODULIN; IHBD 


TOPI 


GDB:120444 


TOPOISOMERASE (DNA) I; TOPI 



Table 22: Genes, Locations and Genetic Disorders on Qiromosome 21 



Gene 


GDB Accession DD 


OMIMLink 


AIRE 


GDB:567198 


AUTOIMMUNE 

POLYENDOCRBSfOPATHY-CANDIDIA 
SIS-ECTODERMAL DYSTROPHY; 


APP 




ALZHEnVTER DISEASE- AD AMYT OTD 
BETA A4 PRECURSOR PROTEIN; APP 


CBS 


GDB:119754 


HOMOCYSTINURIA 


C0L6A1 


GDB:119065 


COLLAGEN, TYPE VI, ALPHA-1 
CHAIN; C0L6A1 MYOPATHY, 
BENIGN CONGENITAL, WITH 


C0L6A2 


GDB:1 19793 


COLLAGEN, TYPE VI, ALPHA-2 
CHAIN; COL6A2 MYOPATHY, 
RFNTGN rONGPNTTAT WITH 

CONTRACTURES 






A/TVOPT ONTTS EPTT EPSY OF 

UNVEKRICHT AND LUNDBORG 

CYSTATINB; CSTB 


DCR 


GDB: 125354 


TRISOMY 21 


DSCRl 


GDB:731000 


TRISOMY 21 


EPDMM 


GDB:9954610 


CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNTT 2; CBFA2 
PLATELET DISORDER, FAMILIAL, 
WITH ASSOCIATED MYELOID 
MALIGNANCY 


HLCS 


GDB:392648 


MULTff LE CARBOXYLASE 
DEFICIENCY, BIOTIN-RESPONSIVE; 
MOD 


HPEl 


GDB:136065 


HOLOPROSENCEPHALY, FAME^IAL 
ALOBAR 


ITGB2 


GDB:120574 


INTEGRIN BETA-2; ITGB2 


KCNEl 


GDB:127909 


POTASSIUM VOLTAGE-GATED 
CHANNEL, ISK-RELATED 
SUBFAMILY, MEMBER 1; 
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Gene 


GDB Accession ID 


OMfiVfY.^nlc'' """^ '' ^ — 


KNO 


GDB:4073044 


KNOBLOCH SYNDROME; KNO 


PRSS7 


GDB:384083 


ENTEROKINASE DEFICffiNCY 


RUNXl 


GDB:128313 


CORE-BINDING FACTOR, RUNT 
DOMAIN, ALPHA SUBUNTT 2; CBFA2 
PLATELET DISORDER, FAMILIAL, 
WITH ASSOCIATED MYELOID 
MALIGNANCY 


SODl 


GDB:119596 


AMYOTROPHIC LATERAL 
SCLEROSIS SUPEROXIDE 
DISMUTASE-1; SODl MUSCULAR 
ATROPHY, PROGRESSIVE, WITH 
AMYOTROPHIC LATERAL 
SCLEROSIS 


TAM 


GDB:9958709 


MYELOPROLIFERATIVE 
SYNDROME, TRANSIENT 



Table 23: Genes, Locations and Genetic Disorders on Chromosome 22 



Gene 


GDB Accession ID 


OMIM Link 


ADSL 


GDB:119655 


ADENYLOSUCCINATE LYASE; ADSL 


ARSA 


GDB'l 19007 


A/TFT A PTTR DM A TTr 

LEUKODYSTROPHY, 

LATE-INFANTILE 


BCR 


GDB:120562 


BREAKPOINT CLUSTER REGION; BCR 


CECR 


GDB:119772 


CAT EYE SYNDROME; CES 


CHEEC2 


GDB:9958730 


LI-FRAUMENI SYNDROME; LFS 

OSTEOGENIC SARCOMA 


COMT 


GDB:119795 


CATECHOL-O-METHYLTRANSFERASE 
;COMT 


CRYBB2 


GDB:119075 


CRYSTALLIN, BETA B2; CRYBB2 
CATARACT, CONGENITAL, 

CERULEAN TYPE, 2; CCA2 


CSF2RB 


GDB:126838 


GRANULOCYTE-MACROPHAGE 
COLONY-STIMULATING FACTOR 
RECEPTOR, BETA SUBUNTT; 


CTHM 


GDB:439247 


HEART MALFORMATIONS; CTHM 


CYP2D6 


GDB:132127 


CYTOCHROME P450, SUBFAMILY UD; 

CYP2D 


CYP2D@ 


GDB:119832 


CYTOCHROME P450, SUBFAMILY UD; 
CYP2D 


DGCR 


GDB: 119843 


DIGEORGE SYNDROME; DGS 


DIAl 


GDB:119848 


METHEMOGLOBINEMIA DUE TO 
DEHCIENCY OF METHEMOGLOBIN 
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Gene 


GDB Accession ID 


OMIMLink - ■ 






REDUCTASE 


EWSRl 


GDB:135984 


EWDSTG SARCOMA; EWS 


GGTl 


GDB:120623 


GLUTATHIONURIA 


MGCR. 


GDB:120180 


MENINGIOMA; MGM 


MNl 


GDB:580528 


MENINGIOMA; MGM 


NAGA 


GDB:1 19445 


ALPHA-GALACTOSIDASEB; GALB 


NF2 


GDB:120232 


NEUROFIBROMATOSIS, TYPE H; NF2 


0GS2 


GDB:9954619 


HYPERTELORISM WITH 
ESOPHAGEAL ABNORMALITY AND 
HYPOSPADIAS 


PDGFB 


GDB:120709 


V-SIS PLATELET-DERIVED GROWTH 
FACTOR BETA POLYPEPTIDE; PDGFB 


PPARA 


GDB:202877 


PEROXISOME PROLIFERATOR 
ACTIVATED RECEPTOR, ALPHA; 

PPARA 


PRODH 


GDB:52151o8 


HYPERPROLINEMIA, TYPE I 


SC02 


GDB:99585o8 


CYTOCHROME c OXIDASE 
DEFICIENCY 


SCZD4 


GDB: 1387047 


SCHIZOPHRENIA DISORDER-^; SCZD4 


SERPINDl 


GDB: 120038 


HEPARIN COFACTOR 11; HCF2 


SLC5A1 


GDB: 120375 


SOLUTE CARRIER FAMILY 5, 
MEMBER 1; SLC5A1 


SOXIO 


/^T\"r> .000 >IAOO 

GDB:9834028 


SRY-BOX 10; SOXIO 




/^TM5 • 1 1 O/CAO 

CjrUrJ.l lyoUJS 


IKAJNbUUiJALAMiN llL/JtlrlUibJNUY 


TIMP3 


GDB:138175 


TISSUE INHIBirOR OF 
METALLOPROTEINASE-3; TIMP3 


VCF 


GDB: 136422 


VELOCARDIOFACIAL SYNDROME 



Table 24: Genes, Locations and Genetic Disorders on Chromosome X 



Gene 


GDB Accession ID 


OMIMLink 


ABCDl 


GDB:118991 


ADRENOLEUKODYSTROPHY; ALD 


ACTLl 


GDB:119648 


ACnN-LKE SBQUENCE-1; ACTLl 


ADFN 


GDB:118977 


ALBINISM-DEAFNESS SYNDROME; ADFN; 

ALDS 


AGMX2 


GDB:1 19661 


AGAMMAGLOBULINEMIA, X-LINKED, 
TYPE 2; AGMX2; XLA2 


AHDS 


GDB:125899 


MENTAL RETARDATION, X-LINKED, 
Wrra HYPOTONIA 


AIC 


GDB:118986 


CORPUS CALLOSUM, AGENESIS OF, WITH 
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Gene 


GDB Accession ID 


OMIMLmk 






pwnp Tm? "RTTxr at a msim? a/t a t ttv 




imij.i 1 Q/i/n 
vjUd.iivooj 


AT "RTXTTQTV/r nPTTT AP TVPT7 0- nA9 
J\ wi nXXNXijjyx, Uv^Ui^/\Jx, i X rEs 


AIH3 


GDB:131443. 


AMELOGENESIS IMPERFECTA-3, 

JjL I irLWrX-j/Vo X Xl^ X I X^JD, AUX3 


ALAS2 


GDB:1 19666 


ANEMIA, HYPOCHROMIC 


AMCD 


GDB:5584286 


ARTHROGRYPOSIS MULTIPLEX 

L^UJNiaiiJNXX A, XJXoXALf 


AMELX 


GDB:1 19675 


AMELOGENESIS lMPERFECTA-1, 

XT\rD/^T>T A OTT/^ TVDT?» ATtT1 

MYr'UJriAoliC 1 YlrJS, AlJtlX 


ANOPl 


GDB:128454 


CLINICAL; ANOPl 


AR 


GDB:120556 


ANDROGEN INSENSmVlTY SYNDROME; 

A TC A XTTM5 /^/^T7XT "D TJ/^"m>TOT> • A "D 


ARAFl 


GDB:1 19004 


V-RAF MURINE SARCOMA 361 1 VIRAL 
UJNCULrxiJNxi rlUJVHJL>ULr 1, AKArl 


ARSC2 


GDB:119702 


ARYLSULFATASE C, f FORM; ARSC2 


ARSE 


GDB:555743 


CHONDRODYSPLASIA PUNCTATA 1, 
X-LINKED RECESSIVE; CDPXl 


ARTS 


GDB:9954651 


FATAL X-LINKED, WITH DEAFNESS AND 
LiUoo Ur VlolUJN 


ASAT 


GDB:9954649 


SIDEROBLASTIC, AND 
SPINOCEREBELLAR ATAXIA; ASAT 


ASSP5 


GDB: 119019 


CITRULLINEMIA 


ATP7A 


GDB:1 19395 


ATPase, Cu(2+)-TRANSP0RTING, ALPHA 
POLYPEPTIDE; ATP7A MENKES 
SYNDROME 


ATRX 


GDB 136052 


ALPHA-THALASSEMIA/MENTAL 
RETARDATION SYNDROME, X-LINKED; 
ATRX ALPHA-THALASSEMIA/MENTAL 
RblAKDAllUJN oYJnDKUMis, 
NONDELETION TYPE 


AVPR2 


GDB:131475 


DIABETES If^SIPIDUS, NEPHROGENIC 


BFLS 


uDB:1205oo 


BUKJJioUiN biJNJJKUMls; BUKJ 


BGN 


/^TMD .11 mo n 
(jrDrJ: 11^727. 


BlvjrL.YCAJN; BljJN 


BTK 


GDB:120542 


BRUTON AGAMMAGLOBULINEMIA 

TV1D#^OTMtJ VTVrAOT3< BTT?' 


BZX 


GDB:5205912 


BAZEX SYNDROME; BZX 


CIHR 


GDB:1 19040 


TATA BOX BINDING PROTEIN 
(TBP)-ASSOCIATED FACTOR 2A; TAF2A 


CACNAIF 


GDB:6053864 


NIGHTBLINDNESS, CONGENITAL 
STATIONARY, X-LINKED, TYPE 2; CSNB2 
CALCIUM CHANNEL, 
VOLTAGE-DEPENDENT, ALPHA IF 
SUBUMT; CACNAIF 
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Geae 


GDB Accession ID 


OMBfLink 


CALB3 


GDB:133780 


CALBINDIN3;CALB3 


CBBM 


GDB:9958963 


COLORBLINDNESS, 

BLUE-MONO-CONE-MONOCHROMATIC 
TYPE; CBBM 


CCT 


GDB: 119756 


CATARACT, CONGENITAL TOTAL, WITH 
POSTERIOR SUTURAL OPACITIES IN 
HETEROZYGOTES; 


CDRl 


GDB:119053 


CEREBELLAR DEGENERATION-RELATED 
AU 1 UAJN i ICjiijN-l ; CDRl ; CDK34 


CFNS 


GDB:9579470 


CRANIOERONTONASAL SYNDROME; 
CFNS 


CGFl 


GDB:o2758o7 


COGNITION 


CHM 


GDB: 120400 


CHOROIDEREMIA; CHM 


CHR39C 


GDB: 119779 


CHOLESTEROL REPRESSBLE PROTEIN 
39C; CHR39C 


CIDX 


GDB:127736 


SEVERE COMBINED 
IMMUNODEFICIENCY DISEASE, 
X-LINKED, 2; SCIDX2 


CLA2 


GDB:119782 


CEREBELLAR ATAXIA, X-LINKED; CLA2 


CLCN5 


GDB:270667 


CHLORIDE CHANNEL 5- CLCN5 FANCONI 
SYNDROME, RENAL, WITH 
NEPHROCALCINOSIS AND RENAL 
STONES NEPHROLITHIASIS, X-LINKED 
RECESSIVE, WITH RENAL FAILURE; XRN 


CLS 


GDB:119784 


RIBOSOMAL PROTEIN S6 KINASE, 90 KD, 
POLYPEPTIDE 3; RPS6KA3 
COFFIN-LOWRY SYNDROME; CLS 


CMrX2 


GDB:128311 


CHARCOT-MARE-TOOTH NEUROPATHY, 
X-LINKED RECESSIVE, 2; CMTX2 




GDB:128151 


CHARCOT-MARIE-TOOTH NEUROPATHY, 
X-IJNKED RECESSIVE, 3; CMTX3 


CND 


GDB:9954627 


DERMOIDS OF CORNEA; CND 


CODl 


GDB: 11 9787 


CONE DYSTROPHY, X-LINKED, 1; CODl 


C0D2 


GDB:6520166 


CONE DYSTROPHY, X-LINKED, 2; C0D2 


COL4A5 


GDB: 120596 


COLLAGEN, TYPE TV, ALPHA-5 CHAIN; 
COL4A5 LEIOMYOMATOSIS, 
ESOPHAGEAL AND VULVAL, WITH 
NEPHROPATHY 


COL4A6 


GDB:222775 


COLLAGEN, TYPE IV, ALPHA-6 CHAIN; 
COL4A6 LEIOMYOMATOSIS, 
ESOPHAGEAL AND VULVAL, WITH 
NEPHROPATHY 


CPX 


GDB: 120598 


CLEFT PALATE, X-LINKED; CPX 
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Gene 


GDB Accession ID 


OMIMLink 


CVDl 


GDB:9954659 


CARDIAC VALVULAR DYSPLASIA, 


CYBB 


(338:120513 


GRANULOMATOUS DISEASE, CHRONIC; 


DCX 


GDB:9823272 


LISSENCEPHALY, X-LINKED 


DEN2 


GDB;119091 


DEAFNESS, X-LINKED 2, PERCEPTIVE 
CONGENITAL; DFN2 


DFN4 


GDB:433255 


DEAFNESS, X-LINKED 4, CONGENITAL 


DFN6 


GDB:1320698 


DEAFNESS, X-LINKED, 6, PROGRESSIVE; 
DFN6 


DHOF 


GDB:1 19847 


FOCAL DERMAL HYPOPLASIA; DHOF 


DIAPH2 


GpB:9835484 


DIAPHANOUS, DROSOPHILA, HOMOLOG 
OF, 2 DKCIGDB: 119096 DYSKERATOSIS 
CONGENITA; DKC 


DMD 


GDB:119850 


MUSCULAR DYSTROPHY, 
PSEUDOHYPERTROPHIC PROGRESSIVE, 

JJUUxlbJNJNJi AJNJJ x5xlL/JSJiK 


DSS 


GDB:433750 


DOSAGE-SENSrriVE SEX REVERSAL; DSS 


DYT3 


GDB:118789 


TORSION DYSTONIA-3, X-LINKED TYPE; 
DYT3 


EBM 


GDB:119102 


BULLOUS DYSTROPHY, HEREDITARY 
MACULAR TYPE 


EBP 


GDB:125212 


CHONDRODYSPLASL^ PUNCTATA, 
X-LINKED DOMINANT; CDPX2; CDPXD; 
CPXD 


EDI 




ECTODERMAL DYSPLASL^, ANHIDROTIC; 
EDA 


ELKl 


GDB: 119867 


ELKl, MEMBER OF ETS ONCOGENE 
FAMILY; ELKl 


EMD 


GDB:1 19108 


MUSCULAR DYSTROPHY, TARDIVE, 
DREIFUSS-EMERY TYPE, WITH 

/^/^■XT'T'D A r^TT TD TJ C 

V^UJN 1 KAC i UKxio 


EVR2 


GDB: 136000 


HAUDAllVli VllK^^UJXJbllJNUJrAixlY, 
FAMILIAL, X-LINKED RECESSIVE; EVR2 


F8C 


GDB:119124 


HEMOPHILIA A 


F9 


GDB: 119900 


HEMOPHILIA B; HEMB 


FCrl 


CjDJ3:34/4yU 


Tj r^T^T T "DO rw\f ir^^nrxKX v t tmi?'T7t>- T7n> y 
r -UMLrL, x'KUl-i U d iUJN , A-JLJJNJIsJCfL/, r Uir A 




10099 


SYNTRFTASF-^- FPSL5 


FGDl 


GDBrl 19131 


SYNDROME FACIOGENTTAL DYSPLASIA; 

FGDY 


FGSl 


GDB:9836950 


FG SYNDROME 
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Gene 


GDB Accession ID 


OMIMLmk 


FMRl 


GDB:129038 


FRAGE.E SITE MENTAL RETARDATION- 1 ; 

FMRl 


FMR2 


GDB:141566 


FRAGILE SITE, FOLIC ACID TYPE, RARE, 
FRA(X)(q28); FRAXE 


G6PD 


GDB: 120621 


GLUCOSE-6-PHOSPHATE 


GABRA3 


GDB:119968 


GAMMA-AMBSrOBUTYRIC ACID 
RECEPTOR, ALPHA-3; GABRA3 


GATAl 


GDB:125373 


GATA-BINDING PROTEIN 1; GATAl 


GDIl 


GDB:1347097 


GDP DISSOCL\TION INHIBITOR 1; GDIl 
MENTAL RETARDATION, X-LINKED 
NONSPECIFIC, TYPE 3; MRX3 


GDXY 


GDB:9954629 


DYSGENESIS, XY FEMALE TYPE; GDXY 


GJBl 


GDB:125246 


CHARCOT-MARIE-TOOTH PERONEAL 
MUSCULAR ATROPHY, X-LINKED; CMTXl 
UAJr JUJNdiUJN JrKUiiilJN, oiilA-l, 5JL JsXi; 
GJBl 


GK 


(jDi3:liy271 


H 1 1* JlKUJL I L/CKULJiMiA. 


GLA 


GDB: 119272 


ANGIOKERATOMA, DIFFUSE 


GPC3 


GDB:3770726 


GLYPICAN-3; GPC3 SIMPSON 

D Y bMUiCriliA o i JNDKUJVLb; bJD Y JS 


GRPR 


GDB:128035 


GASTRJN-RELEASING PEPTIDE 
KbL/HriUK, (jKrK 


GTD 


CjDB:yy54o35 


uUJNAUUlKUrJUN DUrlCJJiWCY, LrlLl 


GUST 


GDB:9954655 


MENTAL RETARDATION WIIH OPTIC 
ATROPHY, DEAFNESS, AND SEIZURES 


HMSl 


GDB:251827 


1;HMS1 


HPRTl 


GDB:1 19317 


HYPOXANTHINE GUANINE 
PHOSPHORIBOSYLTRANSFERASE 1; 
HPRTl 


HPT 


GDB:1 19322 


HYPOPARATHYROIDISM, X-LINKED; 
HYPX 


HTC2 


GDB:700980 


HYPERTRICHOSIS, CONGENITAL 

/^T7XTCT> A T TTTjTV. /^r^TU. TJ/^iTl 


HTR2C 


GDB:378202 


5-@riYDKUAY IKirlAMilNJl KUL/iUrlUK 

2C; HTR2C 


HYR 


GDB:9954625 


REGULATOR; HYR ' 


IDS 


GDB:120521 


MUCOPOLYSACCHARIDOSIS TYPE H 


fflGl 


GDB: 119343 


HYPOPLASIA OF, WITH GLAUCOMA; IHG 


E.2RG 


GDB:134807 


INTERLEUKIN-2 RECEPTOR, GAMMA; 
IL2RG SEVERE COMBINED 
IMMUNODEFICIENCY DISEASE, 
X-LINKED, 2; SCIDX2 
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Gene 


KrUD Accession ID 


OjVLLVL JLinK 


TXTrkV 

IMJJA 




X-LINKED 




GDB:120105 


INCONTINENTIA PIGMENTI, TYPE I; IPl 


IP2 


GDB:120106 


INCONTINENTIA PIGMENTI, TYPE E; TP2 


JMS 


GDB:204055 


MENTAL RETARDATION, X-LINKED, 
WITH GROWTH RETARDATION, 


KALI 


GDB:120116 


KALLMANN. SYNDROME 1; KALI 


KFSD 


GDB:128174 


KERATOSIS FOLLICULARIS SPINULOSA 
DECALVANS CUM OPHIASI; KFSD 


LICAM 


GDB:120133 


CLASPED THUMB AND MENTAL 
RETARDATION LI CELL ADHESION 
MOLECULE; LICAM 


LAMP2 


GDB:125376 


LYSOSOME-ASSOCIATED MEMBRANE 


MAA 


GDB:1 19372 


MlUKUJrxl 1 JiAi^iVLlA UK AJNUJrrl ixlALMUa, 
WITH ASSOCIATED ANOMALIES; MAA 


MAFD2 


GDB:1 19373 


PSYCHOSIS, X-LINKED 


MAOA 


(jrUi3:12UlD4 




MAOB 


GDB:1 19377 


MONOAMINE OXIDASE B; MAOB 


MCF2 


GDB:120168 


MCF.2 CELL LINE DERIVED 
TRANSFORMING SEQUENCE; MCF2 


MCS 


(a)B: 128370 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-4, WITH CONGENITAL 


MEAX 


GDB:1193o3 


V T T\TVT3r» tH/ri'LT U'V/~'T3QCT\/TJ 

A-L/ilNJsxHJ, WllJtl XlAUllisMVxl 

AUTOPHAGY; XMEA; MEAX 


MECP2 


GDB:3851454 


SYNDROME; RTT 


MF4 


GDB:119386 


METACARPAL 4-5 FUSION; MF4 


MGCl 


CT>B:120179 


MEGALOCOKNEA; MGCl; MGCN 


MIC5 


(SDB:120526 


SURFACE ANTIGEN, X-LINKED; SAX 


MIDI 


Ga)B:9772232 


OPrrZ SYNDROME 


MLLT7 


(a>B:392309 


MYELOID/LYMPHOID OR 
MDCED-LINEAGE LEUKEMIA, 
TRANSLOCATED TO, 7; MLLT7 


MLS 


GDB:262123 


MICROPHTHALMIA WIIH LINEAR SKIN 
DEFECTS; MLS 


MRSD 


ODB: 119398 


MENTAL RETARDATION, SKELETAL 
DYSPT A5?TA AND ABDUCENS PALSY* 
MRSD 


MRX14 


GDB:138453 


RETARDATION, X-LINKED 14; MRX14 


MRXl 


GDB:120193 


MENTAL RETARDATION, X-LINKED 
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iarUJS /VGCcSSlOn JLLI 








NONSPECMC, TYPE 1; MRXl 


MRX20 


GDB:217050 


MENTAL RETARDATION, X-LINKED 20; 
MRX20 


MRX2 


GDB:120194 


RETARDATION, X-LINKED NONSPECaFIC, 
TYPE2;MRX2 


MRX3 


GDB: 128105 


GDP DISSOCIATION INHIBITOR 1; GDIl 
MENTAL RETARDATION, X-LINDEOBD 

NONSPECIFIC, TYPE 3; MRX3 


MRX40 


GDB:700754 


MENTAL RETARDATION, X-LINKED, 

\ArrTTT TTVPriTmsJTA 


MKAA 




NONSPECMC, Wira APHASLi^ MRXA 


MoU 


r^T^U'l 1 OQOQ 


o I JN JJJvLIJYLtl 


MlMl 


/TT^"D* 1 1 0AQQ 
ljrJJx5. 1 iy4jy 


IVyrvnTTTOTTT AP IV/fVDP ATWV 1 • AyTTTV/fl 




oJJxs.izuzuy 


MYCL2 


MYPl 


GDB:127783 


MYOPIA, X-LINKED; MYPl, 


JNDr 




XmP P TT? 'HTQT? A QT? • "NTT^P 


NHS 


GDB:120235 


CATARACT-DENTAL SYNDROME 


NPHLl 


GDB:433705 


NEPHROLITHIASIS, X-LINKED 




uJJJtS.iloyoZ 


ATiPTHXTAT trVPnPT AQTA r^OXTrilHTvTTT A T • 

AHC 








X7V01 




XTVCT A riA>fT TC Y T TMYPn* XTVQ 
JN X o X jrVvjlMUo, A-JLXXNJSJiUj IN I o 


NYX 


GDB:119814 


NIGHTBLINDNESS, CONGENITAL 

CT A TJfWX A P V TkTTVTJ A/fVT^PT A • QXT0 1 
ol AilUJNAJvl, WllJtl iVlI VJJrXA, ^oiNi>X 


OAl 


GDB:119459 


ALBESnSM, OCULAR, TYPE 1; OAl 


OASD 


GDB:138457 


OCULAR, WITH LATE-ONSET 






T OITiTT? r>r^TTT f^r^PPPPPOPPXTAT 
J_/UWJl1 Uv^UX-rwVp^XlJNJ&XJXv^JNJliNAX^ 

SYNDROME; OCRL 


ODTl 


GDB:125360 


TEETH, ABSENCE OF 


OFDl 


GDB:120248 


OROFACIODIGITAL SYNDROME 1; OFDl 


UrAZ 


LrUxJ.l/jjJo 


r\DTJr^ A TP PiP WV 0 • DP A '7 


OPDl 


GDB: 120249 


OTOPALATODIGITAL SYNDROME 


OPEM 


GDB: 119467 


OPHTHALMOPLEGIA, EXTERNAL, AND 
MYOPTA* OPFM 

iVX X \JX X.'^y \Jx XiXVX 


OPNILW 


GDB:120724 


COLORBLINDNESS, PARTIAL, PROTAN 
SERIES; CBP 


OPNIMW 


GDB:120622 


COLORBLINDNESS, PARTIAL, DEUTAN 
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Gene 


GDB Accession ID 


OMIMLink 






SERIES; CBD; DCB 


OTC 


GDB:1 19468 


ORNITHINE TRANSCARBAMYLASE 
DEFICIENCY, HYPERANflMONEMIA DUE 

TO* CYVn 


P3 


GDB:yy54oo7 


irK-UIJCfitN STD 


T\i-vTT A i 

PDHAl 


(jDD.llooyj 


El-ALPHA POLYPEPTIDE-1; PDHAl 


PDR 


GDB:203409 


AMYLOIDOSIS, FAMILIAL CUTANEOUS 


PFC 


GDB:120275 


PROPERDIN DEHCIENC Y, X-LINKED 


PEKFBl 


GDB:125375 


6-@PHOSPHOFRUCTO-2-KINASE; PFKEBl 


PGKl 


GDB:120282 


PHOSPHOGLYCERATE KINASE 1; PGKl 


PGKIPI 


GDB:120283 


PHOSPHOGLYCERATE KINASE 1; PGKl 


PGS 


GDB:128372 


DANDY-WALKER MALFORMATION WTTH 
MENTAL RETARDATION, BASAL 
GANGLIA DISEASE, 


PHEX 


GDB: 120520 


HYPOPHOSPHATEMIA, VITAMIN 

"DTJOTCT AXTT "DTOVRTQ* "CTVP 


PHKAl 


GDB: 120285 


'DXJOC'DTJO'DVT ACTJlTTMACn ATPTTA 1 
irrlUor^ilUKl LfAoEf JsJJNAoJi, AJLrxlA 1 

SUBUNTT (MUSCLE); PHKAl 


PHKA2 


GDB: 127279 


/-IT xrr^rM^TTKT CTOTJ A niS TiXQX^ A CT3 \7TTT 


PHP 


GDB:119494 


FANxlYJrUrll Ull AKJLoM, rrir 


PIGA 


GDB:138138 


Tity/^OTfLI A 'T*TT\'VT TKT/^CTTOT m "VT^ A XT 

FHUiSrJ±A.llDYLlNUoliULf Gi-YUAJN, 
CLASS A; PIGA 


PLPl 


GDB: 120302 


PROTEOLIPID PROTEIN, MYELIN; PLP 


POFl 


GDB:120716 


T»TIT?TI A A TT TOT? /^AT" A "DT A TvT "C A TT T TOIiJ 1 • "Di^TTI 

PREMATUKfc. UVARiAJN rAILUKU 1, rUrl 


POLA 


GDB: 120304 


POLYMERASE, DNA, ALPHA; POLA 


POU3F4 


GDB:351386 


DEAFNESS, COISIDUCTIVE, WITH STAPES 
FIXATION 


PPMX 


GDB:9954669 


RETARDATION WITH PSYCHOSIS, 
PYRAMIDAL SIGNS, AND 
MACKOUKCxilUloM 


PRD 


GDB:371323 


DYSPLASIA, PRIMARY 


PRPSl 


GDB:120318 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-I; PRPSl 


PRPS2 


GDB:120320 


PHOSPHORIBOSYLPYROPHOSPHATE 
SYNTHETASE-H; PRPS2 


PRS 


GDB:128368 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-2, WITH DYSMORPHISM 
AND CEREBRAL 


PRTS 


(H)B:128367 


PARTINGTON X-LINKED MENTAL 
RETARDATION SYNDROME; PRTS 
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Gene 


GDB Accession ID 


„^ .11- U.HI' U — ijHiV nmJI lUL .~J!i IL.^ luU .1^ 1 

OMIMLink 


PSr2 


GDB:119519 


TO AXTCT>0'DTT30 O S.\>n» TAPO 

IKANolrUxCliiKz, AbU, lArZ 


RfilSIBP 


(jDB:1337yz 


"DT7KTTNT "DTKmTXm t>P "DThXTRP 


RENSl 


CTDB:yoOoi4o 


RENPENNINGTYPE 


RP2 


GDB:120353 


RETINinS PIGMENTOS A-2; RP2 


RPo 


CjDB:1233o1 


"DTnAvrmvTTOQ A A» 17 PA 
JriVjMllXN 1 Uo A-0, JtvJrO 


RPGR 


GDB:1 18736 


RETINITIS PIGMENTOSA-3; RP3 


RPS4X 


GDB:128115 


RIBOSOMAL PROTEIN S4, X-LINKED; 

T>"DC /I"V 


RPSoKAS 


GDB:365648 


RlBOoUMAL rXUlULIN oo JsJUNAoii, y\) JsJJ, 
POLYPEPTIDE 3; RPS6KA3 


RSI 


GDB:119581 


Tt'CnvTi^Cl/^TJTOTC. "DO 

RETINL/oLJll&lb; K.O 


Sll 


GDB:120361 


ANTIGEN, X-UNKED, SECOND; SAX2 


SDYS 


GDB:119590 


GLYPICAN-3; GPC3 SIMPSON 
DYSMORPHIA SYNDROME; SDYS 


SEDL 


GDB:120372 


SPONDYLOEPIPHYSEAL DYSPLASIA, 
LATE; SEDL 


SERPINA7 


GDB: 120399 


THYROXINE-BINDING GLOBULIN OF. 
SERUM; TBG 


SH2D1A 


GDB:120701 


IMMUNODEFICIENCY, X-LINKED 

PPi^OPTJOCRTT? /"^OAvTDTXrPTl \7'APTA'DT T7 


SHFM2 


GDB:226635 


SPLIT-HAND/SPLrr-FOOT ANOMALY, 
A-LiNKJiD 


SHOX 


GDB:6118451 


SHORT STATURE; SS 


SLC25A5 


GDB:125190 


ADENINE NUCLEOTIDE TRANSLOCATOR 
2; ANT2 


SMAX2 


GDB:9954643 


SPINAL MUSCULAR ATROPHY, X-LINKED 

T T3Tr»TT A T TXTT7 A XTT»TT T? 

LETHAL INFANTILE 


SRPX 


GDB:3811398 


KbiUNlTlb FiGJVmjN lUbA-i; Kri 


SRS 


GDB:13o337 


MEN 1 AL Rb 1 ARDA i lUN, X-LrJJNJsJiJJ, 
SNYDER-ROBINSON TYPE 


STS 


GDB: 120393 


ICHTHYOSIS, X-LINKED 


SYNl 


GDB: 11 9606 


OVTVT A "OCTVT T. CVXTI 

bYJNArbUN 1; biJNl 


SYP 


GDB:125295 


SYNAPTOPHYSIN; SYP 


TAFl 


GDB:120573 


TATA BOX BINDING PROTEIN 

(TBP)- ASSOCIATED FACTOR 2A; TAF2A 


TAZ 


GDB:120609 


CARDIOMYOPATHY, DILATED 3A; 
FIBROELASTOSIS-2; EFE2 


TBX22 


GDB:10796448 


CLEFT PALATE, X-LINKED; CPX 


TDD 


GDB:119610 


MALE PSEUDOHERMAPHRODITISM: 



210 



wo 2004/010106 PCT/US2003/023075 



Gene 


GDBAccessioalD 


OMBMLink"^ "u*^^ 






DEFICffiNCY OF TESTICULAR 
17^0-DESMOLASE; 


TFE3 


GDB:125870 


TRANSCRIPTION FACTOR FOR 
IMMUNOGLOBULIN HEAVY-CHAIN 


xrlAo 






THC 


GDB:125361 


THROMBOCYTOPENIA, X-LINKED; THC; 


TIMM8A 


GDB:119090 


DEAFNESS 1, PROGRESSIVE; DFNl 


TMPl 


GaDB:l 19615 


TISSUE INHIBITOR OF 
METALLOPROTEINASE-1; TIMPl 


TKCR 


GDB:1 19616 


TORTICOLLIS, KELOIDS, 
CRYPTORCHIDISM, AND RENAL 

DYSPLASM; TKC 


TNFSF5 


GDB:120632 


IMMUNODEFICIENCY WITH INCREASED 
IgM 


UBEl 


GDB:118954 


UBIQUTTIN-ACTIVATING ENZYME 1; 


UBE2A 


GDB:131647 


UBIQUTTIN-CONJUGATING ENZYME E2A; 


WAS 


GDB: 120736 


WISKOTT-ALDRICH SYNDROME; WAS 


WSN 


GDB:125864 


PARKINSONISM, EARLY-ONSET, WITH 
MENTAL RETARDATION 


WTS 


GDB:128373 


MENTAL RETARDATION, X-LINKED, 
SYNDROMIC-6, WITH GYNECOMASTIA 


wws 


GDB:120497 


WIEACKER SYNDROME 


XIC 


GDB:120498 


X-INACnVATION-SPECIFIC TRANSCRIPT; 

VTQT 






XIST 






w T nnr tq 


AM 




VTV>f CVCTPX/T 
AlVl o X 0 1 JdJVI 


xs 


GDB:1 19636 


LUTHERAN SUPPRESSOR, X-LINKED; XS; 

T TTYQ 


ZEX 


(aDB:120502 


ZINC FINGER PROTEIN, X-LINKED; ZFX 


ZIC3 


GDB:249141 


HETEROTAXY, X-LINKED VISCERAL; 




GDB '9785766 


MENTAL RETARDATION X-LINKED- 
DXS6673E 


ZNF41 


GDB:125865 


ZESrC FINGER PROTEIN-41; ZNF41 


ZNF6 


GDB:120508 


ZINC FINGER PROTEIN-6; ZNF6 
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Table 25: 


Genes, Locations and Genetic Disorders on Chromosome Y 


Gene 


GDB Accession ID 


OMIMLink 


AMELY 


vjJJi5:iiyo /o 


AAyfFT n(T^PT«JTM Y-CHROMOSOMAT • AMELY 


ASSP6 




nrPTTT T TNTPA/TTA 


AZFl 


GDB:1 19027 


AZOOSPERMIA FACTOR 1; AZFl 


KZFl 


GDB:456131 


AZOOSPERMIA FACTOR 2; AZF2 


DAZ 


GDB:635890 


DELETED IN AZOOSPERMIA^ DAZ 


GCY 


GDB:119267 


CONTROL, Y-CHROMOSOME INFLUENCmD; 
GCY 


RPS4Y 


GDB:128052 


RIBOSOMAL PROTEIN S4, Y-LINKED; RPS4Y 


SMCY 


GDB:5875390 


mSTOCOMPATBILrrY Y ANTIGEN; HY; 
HYA 


SRY 


GDB:125556 


SEX-DETERMINING REGION Y; SRY 


ZFY 


GDB:120503 


ZINC FINGER PROTEIN, Y-DNKED; ZFY 


Table 26: 


Genes, Locations and Genetic Disorders in Unknown or Multiple 

Locations 


Gene 


GDB Accession n) 


OMBVfLink 


ABAT 


GDB:581658 


GAMMA-AMINOBUTYRATE TRANSAMINASE 


AEZ 


GDB:128360 


ACRODERMATITIS ENTEROPATHICA, 
ZINC-DEFICIENCY TYPE; AEZ 


AFA 


GDB:265277 


FILIFORME ADNATUM AND CLEFT PALATE 


AFDl 


GDB:265292 


DYSOSTOSIS, TREACHER COLLINS TYPE, 
WTTHLIMB ANOMALIES 


AGSl 


GDB:10795417 


ENCEPHALOPATHY, FAMILIAL INFANTILE, 
WITH CALCIFICATION OF BASAL GANGLIA 


ASAH 


GDB:6837715 


FARBER LIPOGRANULOMATOSIS 


ASDl 


GDB:6276019 


ATRIAL SEPTAL DEFECT; ASD 


ASMT 


GDB:136259 


CETYLSEROTONIN METHYLTRANSFERASE; 
ASMT ACETYLSEROTONIN 
METHYLTRANSFERASE, Y-CHROMOSOMAL; 
ASMTY;HIOMTY 


Bcaa 


GDB:118758 


CHOREA, HEREDITARY BENIGN; BCH 


CCAT 


GDB:118738 


CATARACT, CONGENITAL OR JUVENILE 


CECR9 


GDB:10796163 


CAT EYE SYNDROME; CES 


CEPA 


GDB:581848 


CONTROL, CONGENITAL FAILURE OF 


CHED2 


GDB:9957389 


CORNEAL DYSTROPHY, CONGENITAL 
HEREDITARY 
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Gene 


GDB Accession ID 


OMBMLink 


CLAl 


GDB:1 19781 


CEREBBLLOPARENCHYMAL DISORDER m 


CLA3 


GDB:128453 


C3EREBELL0PARENCHYMAL DISORDER I; CPD I 


CLN4 


GDB:125229 


CEROID-LIPOFUSCINOSIS, NEURONAL 4; CLN4 


CPO 


GDB:119070 


COPROPORPHYRIA 




GDB118777 


COLONY STIMULATING FACTOR 2 RECEPTOR, 

ALPHA; CSF2RA 
GRANULOCYTE-MACROPHAGE 

ALPHA SUBUNTT, 


CTSl 


GDB:1 18779 


CARPAL TUNNEL SYNDROME; GTS; GTS 1 


DF 


CH)B:132645 


FACTOR D 


DIHl 


GDB :43 9243 


TNT A T»TTT> A K A T'T/^ 

DIAPHRAGMATIC 


DWS 


GDB:128371 


SYNDROME; DWS 


DYT2 


GDB:1 18788 


DYSTONIA MUSCULORUM DEFORMANS 2; 

DYT2 


DYT4 


GDB:433751 


DYSTONIA MUSCULORUM DEFORMANS 4; 
DYT4 


EBR3 


GDB:1 18739 


NEUROTROPHICA 


ECT 


GDB:128640 


CENTRALOPATHIC EPILEPSY 


EEF1A1L14 


GDB:1327185 


PROo 1 Al IC UAKUUN UMA UJNL^UUliJNli 1- 1 


EYCL2 


GDB:464281S 


EYE COLOR-3; EYCL3 


FAl 


GDB:118795 


FANCONI ANEMIA, COMPLEMENTATION 
CjRUUr A; ACA 


FANCB 


GDB:9864269 


FANCONI FANCY 1 UFbJNlA, 1 YrH z 


GCSH 


GDB:126842 


HYPERGLYCINEMIA, ISOLATED NONKETOTIC, 
TYPEin;NKH3 


GCSL 


GDB:132139 


ISOLATED NONKETOTIC, TYrIa IV, JNJsll4 


GDF5 


GDB:433948 


CARTlLAGil-DJiKIVllD MUJKJrnLiurllJNJtlllC 
PROTEIN 1 


GIF 


GDBrl 19985 


GASTRIC INHIBITORY POLYPEPTIDE; GIP 


GTS 


GDB:1 18807 


GILLES DE LA TOURETTE SYNDROME; GTS 


HHG 


GDB: 11 8740 


HYPbRGOXNADUixtUJrlU rli JrUvjUJNAUioJM, xltHj 


HMI 


GDB:265275 


OF rrO; HMI 


HOAC 


GDB:118812 


DEAFNESS, CONGENITAL, AUTOSOMAL 
RECESSIVE 


JtlUJvJfrZ 




TTYPOKAT EMir PERIODIC PARALYSIS. TYPE Hi 
H0KPP2 


HRPTl 


GDB: 125252 


HYPERPARATHYROIDISM, FAMILIAL PRIMARY 


HSD3B3 


GDB:676973 


GIANT CELL HEPATITIS, NEONATAL 
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Gene 


GDB Accession ID 


OMIM Link 


HTCl 


GDB:26528o 


HYPERTRlL/ilUolo UJNlVxiKoALflo UUJNljJliNliA, 
AMBRAS TYPE; HTCl 


HVIS 


/~iTXT^ .r\r\c c r\f\c\ 

GDB:9955009 


HRRPRS VlKUo orlJNolJLlVll Y, rlV io 


ICRl 


GDB:127785 


LAMELLAR, AUTOSOMAL DOMINANT FORM 


ICRS 


GDB:127789 


ICHTHYOSIS CONC^NTTA, HARLEQUIN FETUS 
TYPE 


IL3RA 


GDB:128985 


INTERLEUKIN-3 RECEPTOR, ALPHA; IL3RA 
INTERLEUKJIM-i KiJUilrlUK, 
Y-CHROMOSOMAL; IL3RA 


KAL2 


GDB:265288 


KALLMANN SYNDROME 2; KAL2 


KMS 


GDB:118827 


SYNDROME; KMS 


KRT18 


GDB:120127 


KERATIN 18; KRT18 


KSS 


GDB:9957718 


KEARNS-SAYRE SYNDROME; KSS 


LCAT 


GDB:119359 


FISH-EYE DISEASE; FED 

LECrrHIN:CHOLESTEROL ACYLTRANSFERASE 
DEFICIENCY 


LIMM 


GDB:9958161 


Ik ir\Tr\Tk A T«TTV Ti >rPT«r^/^<TT/^XTr\'D TAT T "CTTT A T 

MYOPATHY, MITOCHONDRIAL, LBlHAlr 
INFANTILE; LBMM 


MANBB 


GDB:125262 


MANNOSIDOSIS, BETA; MANBl 


MCPH2 


GDB:9863035 


MICROCEPHALY; MCT 


MEB 


GDB:599557 


DISEASE 


MELAS 


GDB:9955855 


MELAS SYNDROME 


MIC2 


GDB:120184 


SURFACE ANTIGEN MIC2; MIC2; CD99 MIC2 
SURFACE ANTIGEN, Y-CHROMUISUMAL; MiUz Y 


MPFD 


GDB:439372 


CONGENITAL, WITH FIBER-T YFE 
DISPROPORTION 


MS 


GDB:229116 


SCLEROSIS; MS 


MSS 


GDB:118743 


MARINESCO-SJOGREN SYNDROME; MSS 


MTATP6 


GDB: 11 8897 


ATP SYNTHASE 6; MTATP6 


MTCOl 


GDB:118900 


COMPLEX IV, CYTOCHROME c OXIDASE 
SUBtfNIT I; MTCOl; COI 


MTC03 


GDB- 11 8902 


CYTOCHROME c OXIDASE HI; MTC03 


MTCYB 


GDB' 11 8906 


COMPLEX m, CYTOCHROME b SUBUNIT 


MTNDl 


GDB118911 


COMPLEX I, SUBUNIT NDl; MTNDl 


MTND2 


GDB:118912 


COMPLEX I, SUBUNIT ND2; MTND2 


MTND4 


GDB:1 18914 


COMPLEX I, SUBUNIT ND4; MTND4 


MiJNJJD 




PrnV/TPTPVT QITRTTNTTT MnS- MTND^ 

Vj'iVlJr J-fJZ<j(\. 1, O L/I5 U i>i X X iNXy , IVX X iNX^ J 


MTND6 


GDB:118917 


COMPLEX I, SUBUNIT ND6; MTND6 


MTRNRl 


GDB:1 18920 


RIBOSOMAL RNA, MITOCHONDRIAL, 12S; 
MTRNRl 
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Gene 


GDB Accession ID 


OMIMLink 


MTRNE12 


GDB:118921 


RIBOSOMAL RNA, MITOCHONDRIAL, 16S; 

MTRNR2 


MTTE 


GDB: 118926 


TRANSFER RNA, MTTOCHONDRIAL, GLUTAMIC 
ACID; MTTE 


MTTG 


GDB:118933 


IHANSFERRNA, MITOCHONDRIAL, GLYCINE; 
MTTG 


MTTI 


GDB: 11 8935 


TRANSFER RNA, MITOCHONDRIAL, 
ISOLEUCINE; MTTI 


MTTK 


GDB: 118936 


MERRF SYNDROME TRANSFER RNA, 
MITOCHONDRIAL, LYSINE; MTTK 


MTTLl 


GDB: 118937 


MERRF SYNDROME TRANSFER RNA, 
MITOCHONDRIAL, LEUCINE, 1; MTTLl 


MTTL2 


GDB: 11 8938 


TRANSFER RNA, MITOCHONDRIAL, LEUCINE, 
2;MTTL2 


MTTN 


GDB:118940 


TRANSFER RNA, MITOCHONDRIAL, 
ASPARAGINE; MTTN 


HTTP 


GDB:1 18941 


TRANSFER RNA, MITOCHONDRIAL, PROLINE; 
MTTP 


MTTSl 


GDB:1 18944 


TRANSFER RNA, MITOCHONDRIAL, SERINE, 1 ; 
Ml iol 


NAMSD 


GDB:o81237 


INliUJtv.Ui A i xl 1 , iVLVy 1 \^Jx-ijI1JL\oV^Jv 1,11 Tili 11, 

WITH DEAFNESS AND MENTAL RETARDATION 


NODAL 


GDB:9848762 


NODAL, MOUSE, HOMOLOG OF 


OCDl 


GDB: 11 8846 


DlbUKJJxiK- 1 , UL/JJ 1 


0PD2 


GDB:131394 


bYJNUKUiVLtl/ 


PCK2 


GDB: 137198. 


2, MITOCHONDRIAL; PCK2 


PCLD 


GDB:433949 


POLYCYSTIC LIVER DISEASE; PLD 


PCOSl 


GDB: 1391 802 


GTUTIVT T "D\/T7XmiT A T Q VXTni? OAvTR 


PFKM 


GDB: 120277 


riT VI^OnUXT CTOT? a n-Tn" "T^T^T? A QIh" VTT 

ijj-i 1 UULrJtiJN o 1 L/K/VurU JJIoIl/\ol2r V 11 


PKD3 


GDB: 127866 


PKD3 


PRCAl 


GDB:342066 




PROl 


QDB:l2o5o5 




PROPl 


(H)B:9834318 


PROPHET OF Pm, MOUSE, HOMOLOG OF; 

XiT>C\0\ 
rssxjry. 


RBS 


GDB:lloooz 




PT7YAP 




REGULATORY FACTOR X-ASSOCIATED 
PROTEIN; REXAP 


RP 


GDB:9958158 


RETINITIS PIGMENTOSA-8 


SLC25A6 


GDB:125184 


ADENINE NUCLEOTIDE TRANSLOCATOR 3; 
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Geae 


GDB Accession ID 


OMIM Link 






ANT3 ADENINE NUCLEOTIDE TRANSLOCATOR 


SPG5B 


uDB:250333 


RECESSIVE; SPG5B 


STO 


GDB:439375 


CEREBRAL GIGANTISM 


SUOX 


GDB:5584405 


SULFOCYSTEINURIA 


TC21 


GDB:5573831 


ONCOGENE TC21 


THM 


GDB:439378 


FAMILIAL 


TST 


GDB:134043 


RHODANESE; RDS 


TTD 


GDB:230276 


TRICHOTfflODYSTROPHY; TTD 



Equivalents: 

The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described will become apparent to those skilled in tiie art firom the foregoing 
description and accompanying figures. Such modifications are intended to fall within the 
scope of the appended claims. 

Various publications are cited herein, the disclosures of which are 
incorporated by reference in their entireties. 
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The invention can be illustrated by the following embodiments enumerated 
in the numbered paragraphs that follow: 

1 . A method for identifying a compound that modulates premature 
translation termination or nonsense-mediated mKNA decay, comprising the steps of (a) 

5 contacting a detectably labeled target RNA molecule with a library of compounds under 
conditions that permit direct binding of the labeled target RNA to a member of the library 
of compounds so that a detectably labeled target KN[A:compound complex is formed; (b) 
separating the detectably labeled target RNArcompoimd complex formed in step (a) from 
uncomplexed target RNA molecules and compounds; and (c) determining a structure of the 

1 0 compound bound to the RNA in the RNAxompound complex. 

2. A method for identifying a compound that modulates premature 
translation termination or nonsense-mediated mRNA decay, comprising the steps of (a) 
contacting a target RNA molecule with a library of detectably labeled compounds under 
conditions that permit direct binding of the target RNA to a member of the library of 

15 labeled compounds so that a target RNAxompound complex that is detectably labeled is 
formed; (b) separating the target RNAxompound complex formed in step (a) from 
uncomplexed target RNA molecules and compounds; and (c) determining a structure of the 
compound bound to the RNA in the RNAxompound complex. 

3. The method of paragraph 1 in which the target RNA molecule 
20 contains regions of 28S rRNA or analogs thereof 

4. The method of paragr^h 1 in which the detectably labeled RNA is 
labeled with a fluorescent dye, phosphorescent dye, ultraviolet dye, infrared dye, visible 
dye, radiolabel, enzyme, spectroscopic colorimetric label, affinity tag, or nanoparticle. 

5. The method of paragraph 1 in which the compound is selected from a 
25 combinatorial library comprising peptoids; random bio-oligomers; diversomers such as 

hydantoins, benzodiazepines and dipeptides; vinylogous polypeptides; nonpeptidal 
peptidomimetics; oligocarbamates; peptidyl phosphonates; peptide nucleic acid libraries; 
antibody libraries; carbohydrate Ubraries; and small organic molecule libraries, including 
but not limited to, libraries of benzodiazq)ines, isoprenoids, thiazolidinones, 
30 metathiazanones, pyrrolidines, morpholino compounds, or diazepindiones. 

6. The method of paragraph 1 in which screening a library of 
compounds comprises contacting tiie compound with the target nucleic acid in the presence 
of an aqueous solution, the aqueous solution comprising a buffer and a combination of salts, 
preferably approximating or mimicking physiologic conditions. 
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7. The method of paragraph 6 in which the aqueous solution optionally 
further comprises non-specific nucleic acids comprising DNA, yeast tRNA, salmon spemi 
DNA, homoribopolymers, and nonspecific RNAs. 

8. The method of paragraph 6 in which the aqueous solution further 
5 comprises a buffer, a combination of salts, and optionally, a detergent or a surfactant In 

another embodiment, the aqueous solution further comprises a combination of salts, &om 
about 0 mM to about 100 mM KCl, firom about 0 mM to about 1 M NaCl, and firom about 0 
mM to about 200 mM MgCfe. In a preferred embodiment, the combination of salts is about 
100 mM KCl, 500 mM NaCl, and 1 0 mM MgCl2. lu another embodiment, the solution 
10 optionally comprises firom about 0.01% to about 0.5% (w/v) of a detergent or a surfactant 

9. Any method that detects an altered physical property of a target 
nucleic acid complexed to a compound from the xmbound target nucleic acid may be used 
for separation of the complexed and non-complexed target nucleic acids in the method of 
paragraph 1. In a preferred embodiment, electrophoresis is used for separation of the 

15 complexed and non-complexed target nucleic acids. In a preferred embodiment, the 
electrophoresis is capillary electrophoresis. In other embodiments, fluorescence 
spectroscopy, surface plasmon resonance, mass spectrometry, scintillation, proximity assay, 
structure-activity relationships ("SAR") by NMR spectroscopy, size exclusion 
chromatography, afiSnity chromatography, and nanoparticle aggregation are used for the 

20 separation of the complexed and non-complexed target nucleic acids. 

10. The structure of the compound of the KNAicompound complex of 
paragraph 1 is determined, in part, by the type of library of compounds. In a preferred 
embodiment wherein the combinatorial Ubraries are small organic molecule Ubraries, mass 
spectroscopy, NMR, or vibration spectroscopy are used to determine the structure of the 

25 compounds. 
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1 . A method for identifying a compound that binds to a target KiSFA, 
said method comprising 

(a) contacting a detectably labeled target RNA molecule with a 
library of compounds under conditions that permit direct 
binding of the labeled target RNA to a member of the library 
of compounds and the formation of a detectably labeled target 
RNA:compound complex, wherein the target RNA is a region 
of 28S rRNA or contains a premature stop codon; and 

(b) detecting the formation of a detectably labeled target 
RNAxompound complex. 

2. A method for identifying a compound to test for its ability to 
modulate premature translation termination or nonsense-mediated mRNA decay, said 
method comprising: 

(a) contacting a detectably labeled target RNA molecule with a 
library of compounds under conditions that permit direct 
binding of the labeled target RNA to. a member of the library 
of compoimds and the formation of a detectably labeled target 
RNA:compound complex, wherein the target RNA is a region 
of 28S rRNA or contains a premature stop codon; and 

(b) detecting a detectably labeled target RNA:compound complex 
formed in SEtep(a), so that if a target RNA:compoimd complex 
is detected then the compound identified is tested for its 
ability to modulate premature translation or nonsense- 
mediated mRNA delay. 

3 . A method for identifying a compound that binds to a target RNA, 
said method comprising detecting the formation of a detectably labeled target 
RNA:compound complex formed from contacting a detectably labeled RNA with a member 
of a library of compounds under conditions that permit direct biading of the labeled target 
RNA to a member of the library of compounds and the formation of a detectably labeled 
target RNA:compound complex, wherein the target RNA is a region of 28S rRNA or 
contains a premature stop codon. 
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4. A method of identilEying a compound that modulates premature 
translation termination or nonsense-mediated mRNA decay, said method comprising: 

(a) contactmg a detectably labeled target RNA molecule with a 
library of compounds under conditions that permit direct 

5 binding of flie labeled target RNA to a member of the library 

of compounds and the formation of a detectably labeled target 
RNArcompound complex, wherein the target RNA is a region 
of 28S rRNA or contains a premature stop codon; and 

(b) detecting a detectably labeled target RNArcompound complex 
10 formed in step(a), so that if a target RNAxompound complex 

is detected, then 

(c) contacting the compound with a cell-free translation mixture 
and a nucleic acid sequence comprising a regulatory element 

. operably linked to a reporter gene, wherein the reporter gene 
IS contains a premature stop codon; and 

(d) detecting the expression of the reporter gene, wherein a 
compound that modulates premature translation termination 
or nonsense-mediated mRNA decay is identified if the 
expression of the reporter gene in the presence of the 

20 compound is altered relative to the expression of the reporter 

gene in the absence of tiie compound or the presence of a 
negative control. 

5. A method of identifying a compound that modulates premature 
translation termination or nonsense-mediated mRNA decay, said method comprising: 

25 (a) contacting a detectably labeled target RNA molecule with a 

library of compoimds under conditions that permit direct 
binding of the labeled target RNA to a member of the library 
of compounds and the formation of a detectably labeled target 
RNArcompound complex, wherein the target RNA is a region 

30 of 28S rRNA or contains a premature stop codon; and 

(b) detecting a detectably labeled target RNArcompound complex 
formed in step(a), so that if a target RNArcompound complex 
is detected, then 
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10 



5 



(C) 



(d) 



contacting the compound with a cell containing a nucleic acid 
sequence comprising a regulatory element operably linked to 
a reporter gene, wherein the reporter gene contains a 
premature stop codon; and 

detecting the expression of the reporter gene, wherein a 
compound that modulates premature translation termination 
or nonsense-mediated mRNA decay is identified if the 
expression of tihe reporter gene in the presence of the 
compound is altered relative to the expression of the reports 
gene in the absence of the compound or the presence of a 
negative control. 



6. 



The method of claim 1, 2, 3, 4 or 5, wherein each compound in the 



library is attached to a solid support. 

7. The method of claim 6, wherein the solid support is a siUca gel, a 
15 resin, a derivatived plastic film, a glass bead, cotton, a plastic bead, a polystyrene bead, an 
aluminum gel, a glass slide or a polysaccharide. 



compounds is attached to a chip. 

9. The method of claim 1, 2, 3, 4 or 5, wherein the detectably labeled 
20 KNA is labeled with a fluorescent dye, phosphorescent dye, ultraviolet dye, infixed dye, 
visible dye, radiolabel, enzyme, spectroscopic colorimetric label, afGnity tag,' or 
nanoparticle. 



combinatorial library of compounds comprising peptoids; random biooligomers; 
25 diversomers such as hydantoins, benzodiazepines and dipeptides; vinylogous polypeptides; 
nonpeptidal peptidomimetics; oligocarbamates; peptidyl phosphonates; peptide nucleic acid 
libraries; antibody libraries; carbohydrate libraries; or small organic molecule libraries. 



8. 



The method of claim 1^ 2, 3, 4 or 5, whereia the library of 



10. The method of claim 1, 2, 3, 4 or 5, wherein the compound is a 
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1 1 . The method of claim 1 0, wherein the small organic molecule libraries 
are libraries of benzodiazepines, isoprenoids, thiazoUdinones, metathiazanones, 
pyrrolidines, morpholino compounds, or diazepindiones. 

12. The method of claim 1, 2, 3, 4 or 5, wherein the detectably labeled 
target RNAxompound complex is detected by electrophoresis, fluorescence spectroscopy, 
surface plasmon resonance, mass spectrometry, scintillation, proximity assay, stracture- 
activity relationships ("SAR") by NMR spectroscopy, size exclusion chromatography, 
afiBnity chromatography, or nanoparticle aggregation. 

13. The method of claim 1, 2, 3, 4 or 5, wherem the method further 
comprises determining the stmcture of the compound. 

14. The method of claim 13, wherein the structure of the compound is 
determined by mass spectroscopy, NMR, X-ray crystallography, Edman degradation or 
vibration spectroscopy. 

15. The method of claim 1, 2, 3, 4 or 5, wherein flie premature stop 
codon is UAG, UGA or UAA. 
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Figure 1 
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Figure 2 
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Figure 4 
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Figures 
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Figure 6 
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Figure 7 




7. DMSO 

8. Paromomycin 

9. Compound A (A, B, E, F, G, H) 

10. Compound B (A, B, E. F, G, H) 

11. Compound C (C, D) 

12. Compound D (A, B, E, F, G, H) 
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Figure 8 




9. Compound E (J) 

10. Compound F (J) 

11. Compound G (I) 

12. Compound H (J) 
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Figure 9 
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Figure 10 
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Figure 11 
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SEQUENCE LISTING 
<110> PTC Therapeutics, Inc. 

<120> Methods for Identifying Small Molecules That Modulate Premature 
Translation Termination and Nonsense Mediated mRNA Decay 

<130> 10589-044-22 8 

<140> 
<141> 

<160> 35 

<170> Patentin version 3.2 

<210> 1 

<211> 27 

<212> RNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: I6s rRNA A site 
<400> 1 

ggcgucacac cuucggguga agucgcc 27 

<210> 2 

<211> 29 

<212> RNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: 16s rRNA A site 
<400> 2 

ggcagaucug agccugggag cucucugcc 29 

<210> 3 

<211> 12 

<212> PRT 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence; 16s rRNA A site 
<400> 3 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro 
15 10 



<210> 4 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 4 

aaagcctaca gcaccc 

<210> 5 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 5 

tactgaggga atcctgg 

<210> 6 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 6 

ttaccacccg ctttgggggg ggcgggaaag atcc 

<210> 7 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 7 

99gggcggga aagatcc 

<210> 8 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 8 

ccccgagcca ccttccc 



<210> 9 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 9 

ggccccggga ttcggcg 

<210> 10 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 10 

cactggggac agtccgc 

<210> 11 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<4Q0> 11 

cgcggcgggc gagacggg 

<210> 12 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 12 

gagggaaact tcggaggg 

<210> 13 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 13 

catcgggcgc cttaaccc 

<210> 14 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 14 
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cgacgcacac cacacgc 

<210> 15 

<21l> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 15 

ccaagatctg cacctgc 

<210> 16 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 16 

ttaccgcact ggacgcc 

<210> 17 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

gccagaggct gttcacc 

<210> 18 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

tggggaggga gcgagcggcg 

<210> 19 * 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 19 

aagggcccgg ctcgcgtcc 
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<210> 20 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Description of Artificial Sequence: Primer 

<400> 20 

agggcggggg gacgaaccgc 



<210> 21 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 

<400> 21 

ttaaacagtc ggattcccct gg 22 



<210> 22 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 

<400> 22 

ttcatccatt catgcgcg 

<210> 23 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 

<400> 23 

agtagtggta tttcaccgg 



<210> 24 

<211> 17 

<212> DNA. 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 



<400> 24 

acgggaggtt tctgtcc 



17 
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<210> 25 

<2ll> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 25 

acaatgatag gaagagccg 

<210> 26 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

aggcgttcag tcataatccc 

<210> 27 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 27 

tccgcaccgg accccggtcc 

<210> 28 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 28 

gggctagttg attcggcagg tgagttg 

<210> 29 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 29 

tctccggaat cgaaccct 
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<210> 30 

<211> 18 

<212> DNA 

<2a3> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 



<400> 30 

attaccgcgg ctgctggc 

<210> 31 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 31 

ttggcaaatg ctttcgc 

<210> 32 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence; Primer 
<400> 32 

ccgtcaattc ctttaagttt c 

<210> 33 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 33 

agggcatcac agacctgtta t 

<210> 34 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 34 

cgacgggcgg tgtgtac 



18 
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21 



21 
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<210> 35 
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<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 35 

ccgcaggttc acctacgg 

18 



